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Overtaken by events 


Despite the small number of entries, the genomics X prize is to be commended for attempting to 
push the boundaries of DNA sequencing technology. 


shows how far sequencing technology has come — and how far 

it still has to go. Barring any late surprises, only two teams will 
have signed up by the registration deadline of 31 May to compete for the 
US$10-million prize for the first to sequence 100 centenarians’ genomes 
in 30 days or less at a cost of $1,000 per genome (see page 546). The 
sequences must have no more than one ina million errors, be 98% com- 
plete and have correct haplotype phasing — a determination of which 
parent contributed each portion of a chromosome. It is not possible at 
present to meet this combination of goals with any single technology, 
but that does not explain why so few are reaching for it. 

One reason why more teams are not lining up for the prize is that the 
promise of a genomic medical revolution is not being stalled by any 
lack of data. At genetic-medicine conferences,such as the University of 
Californias OME 2013 precision-medicine conference, held on 2 and 
3 May in San Francisco, or the Big Data in BioMedicine conference held 
from 22 to 24 May at Stanford University in California, you will hear the 
same refrain. “We have more data right now than we know what to do 
with” Figuring out how to interpret genetic data — and, more crucially, 
how to prove their value to patients and health-care systems — is the 
most pressing challenge in genomics today. Researchers can already 
sequence the protein-coding regions of a genome for less than $1,000. 
Getting more data on regions of the genome that they do not yet know 
how to interpret will not help to advance the goal of proving the medical 
worth of big data. 

Interpretation and analysis — making sense of the data — is now 
the real prize. Hence the launch of a spate of bioinformatics chal- 
lenges (see page 547) as researchers compete to surmount that hur- 
dle. They include Sequence Squeeze, a contest to develop the best 
sequence-data-compression algorithm; the Assemblathon, for the best 
program to assemble a genome sequence from scratch; the DREAM 
Challenges to analyse and predict biological interactions among gene 
products; the CLARITY genome-interpretation challenge; and con- 
tests at the annual Beyond the Genome meeting. Michael Schatz at the 
Cold Spring Harbor Laboratory in New York, who has curated many 
of these contests, is planning more challenges this year, including one 
at Cold Spring Harbor later this autumn. Bioinformatics contests 
have the advantage that they do not require physical manufacturing 
infrastructure, so they are more accessible to more would-be solvers 
around the world. 

There are other reasons why the genome X prize is a harder sell 
than other X prizes. The sequencing field is much more mature than 
were other industries that have been the focus of successful X prizes. 
Whereas there was no space-tourism industry before dozens of teams 
competed for the Ansari X Prize in 2004, for instance, there is already 
a thriving commercial market for sequencing. So any company that 
could meet the goals laid out in the prize already has its incentive — 
and it would be worth a lot more than $10 million. The value of the 


Ts sparse list of contenders for the Archon Genomics X Prize 


market leader in sequencing, IIlumina of San Diego, California, is 
currently $8.8 billion. 

And it is very unlikely that anyone other than a well-financed lab or 
large company could attempt the current challenge. That also sets it 
apart from other competitions — the Google Lunar X Prize, for exam- 
ple — in which teams of professionals or even hobbyists can make a 
respectable showing. The thriving do-it-yourself biology movement, 
by contrast, cannot mounta credible challenge to the large life-sciences 

companies. The attempt is even beyond most 


“The promise biotech start-ups. The UK-based biotechnol- 
of a genomic ogy company Oxford Nanopore, for example, 
medical which is trying to commercialize a promising 
revolutionis not technology pioneered by highly respected 
being stalledby _ researchers, has raised at least $150 million in 
any lack grants and investment since 2008 — but has 
of data.” yet to show that its technology can be used to 


sequence a complete human genome. 

That is not to say that the genomics X prize does not matter. The 
X Prize Foundation should be commended for revising the challenge, 
initially laid out in 2006, as the field evolved. It has also done a valuable 
service by working for two years with many partners, including Nature 
Genetics, to outline a judging scheme that can independently assess the 
quality and accuracy of a genome sequence and that is agnostic about 
the sequencing technology used. The foundation deserves kudos 
for prompting the field to reach farther; if past history is any guide, 
genomicists will reach that goal sooner than now seems possible. m 


Still less equal 


Japan’s government must stick by its promise 
to help women’s careers to prosper. 


care centres in Yokohama, by far the highest number of any city in 
Japan. Over the next three years, the city’s (female) mayor, Fumiko 
Hayashi, spent 37 billion yen (US$362 million) on building new infra- 
structure, including 144 childcare centres. Now the waiting list is zero. 
Many female scientists, as well as women working in other sectors, 
celebrated the news. They know that help with child-rearing responsibil- 
ities is essential for a mother to have a successful career. But even better 
were the reverberations, which reached all the way up to the prime min- 
ister, with an indication that change might become more widespread. 
On 20 May, after touring one of the childcare centres with Hayashi, 
Prime Minister Shinzo Abe said that the “Yokohama model” should 


I: 2010, there were 1,552 children waiting to get places in child- 
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be applied across the country. In fact, an economic growth strategy 
report released in April called for childcare capacity to be increased by 
400,000 nationwide. 

Why the sudden focus on such a progressive issue from a man who 
refuses even to consider a popular amendment to Japanese law that 
would allow the imperial line to pass through female members of the 
family? The momentum for change seems to be coming largely from a 
recognition of economic and demographic realities. Japan's population, 
and its labour force, are shrinking, and its economic competitiveness 
is faltering. The country has been frittering away a resource that could 
help to meet these challenges. And the call is getting louder. 

In 2010, for example, the investment bank Goldman Sachs, head- 
quartered in New York, released a widely cited report on “Womenomics’ 
in Japan stating that closing the country’s gender gap — bringing the 
employment rate of women up to that of men — would increase the 
workforce by 8.2 million and boost gross domestic product by 15%. 

Such figures have grabbed the attention of business leaders. Com- 
panies are more likely to allow flexible working hours for mothers, for 
example; government officials are making women's issues part of their 
political platforms; and ‘Abenomics; as the prime minister's aggres- 
sive financial and economic plans are called, seems to be endorsing 
Womenomics (see page 548). But will the right improvements be made? 

The problem facing women in Japan has nowhere been as glaring as 
in science and engineering. The country has taken steps to remedy the 
situation, but there is much more to be done. In 2006, for example, the 
science and education ministry brought in a programme to improve 
support systems for female scientists. Renewed in 2011, the programme 
has so far supported projects at 88 universities and research institutions. 

These projects led to increased on-campus childcare facilities and 
practical support for scientists with children. Most universities now 
have at least one childcare centre. Another programme, launched by the 
science ministry in 2009, and called Supporting Positive Activities for 
Female Researchers, has funded 5-year programmes at 12 universities 


to increase the number and promotion rate of female faculty members. 
These ‘affirmative action programmes have ramped up the number of 
female scientists, especially at higher-level positions. 

And such policies have created momentum. At institutions that 
implemented the plans, the proportion of women among the research 
staff grew from 12.5% in 2005 to 15.4% in 2011. Nationwide fig- 
ures have risen from 11.9% to 13.8% over the same period. But this 

increase is too slow. Japan still has the lowest 


“The Japanese number of females as a percentage of total 
government science researchers in developed countries, 
seems to be according to figures collected by the science 
waking up ministry. A new innovation policy from the 
to women’s Council for Science and Technology Policy, 

tential currently in draft form, calls for the percent- 
io ortance to age of female new hirings at universities and 


research institutes to reach 30% by 2016. It 
is an ambitious goal, considering that this 
figure now is just over 20%. 

Ensuring equal opportunities for women scientists is a global prob- 
lem. But Hisako Ohtsubo, a molecular biologist at Nihon University 
near Funabashi, says that Japan is more than 25 years behind some coun- 
tries, noting that the US National Science Foundation brought in such 
policies in the 1980s. To catch up, the government should expand its pro- 
grammes, which have so far been short-term and on too limited a scale, 
and ensure that they encourage women not only to engage in science 
but also to fight for leadership positions in the scientific community. 

With competition growing, especially from China and South Korea, 
Japan's position as a scientific power is no more secure than its status 
as an economic power. The Japanese government seems to be waking 
up to women's potential importance to the economy. Making the most 
of their talent could be just as potentially transforming for Japanese 
science. The Abe government needs to stick by his promises and take 
the targets seriously. = 


the economy.” 


Without borders 


Increasing scientific globalization is welcome, 
but could compromise national efforts. 


gested, “if the various countries of the world would occasionally 
swap history books, just to see what other people are doing with 
the same set of facts.” 

History is famously written by the winners, and does tend to huddle 
inside national boundaries. Science is different, or so researchers like 
to believe. It is an international process. Facts are shared. Differences 
are ironed out. Scientists know what other people are doing. Correct? 

Yes and no. As an impressive analysis of some 25 million research 
papers ina Comment on page 557 shows, international collaboration 
plays an increasing part in twenty-first-century research. The growth 
in scientific output from the United States and western Europe in the 
past three decades, for example, is entirely down to international col- 
laboration. The number of journal articles that feature authors entirely 
from a single country in those regions has remained the same. 

Only because of global links can the UK government claim, as it did 
in a 2011 report, that the country creates “14 per cent of the world’s 
highly cited output” with just “4 per cent of the world’s gross expendi- 
ture” on research and development. Most of those high-impact papers 
include the contribution of a foreign scientist. In fact, in 2010, the 
number of ‘British’ papers produced with help from abroad exceeded 
those made entirely in Britain for the first time. 

According to the US National Science Foundation, almost 


cc I: might be a good idea,’ the US journalist Bill Vaughan once sug- 
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one-quarter of global research articles in 2010 featured authors from 
more than one country, up from 10% in 1990. The average number of 
authors has doubled since 1980 and now stands at 4.5. 

Although Nature welcomes the global reach of science, we have 
previously pointed out that internationalization will rub up against 
some natural boundaries. National pride and prestige matter, not least 
because science is still mostly funded and managed on a national basis. 

Last October, in a special issue on the globalization of science, we 
said: “Mobility cannot stretch infinitely: relationships, families and 
quality of life put limits on how much researchers want to travel, and 
for how long” (Nature 490, 309-310; 2012). And we pointed out that 
collaboration could blur the borders of national priorities, especially 
for countries that are just beginning to develop their science bases. 

The science bases of some of the bigger developing economies are 
coming along just fine, thank you. As the Comment points out, most 
of the scientific growth in China, Brazil and South Korea is driven by 
domestic work. And quality is rising. The citation impact of more than 
10% of China's domestic research is greater than twice the world average. 

What does this mean? A change in international dynamics for start- 
ers, as Jonathan Adams points out in the Comment: “The older econo- 
mies can no longer rely on the best foreign researchers to come and visit?” 

There are opportunities galore in this new world, but Adams also 
highlights some of the threats. The best institutions in different coun- 
tries already tend to work together the most. Others can find it hard 
to join this global competition. “There is a growing divide between 
international and domestic research,” Adams notes. “This will influ- 
ence each nation’s ability to draw on the global 
knowledge base, and could in turn compromise 
national scientific wealth.” It is not just between 
nations that science must be shared, but within 
their borders too. m 
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scored a goal in a World Cup quarter final against Germany. 

The ball hit the crossbar and bounced down way over the line, 
before it was cleared by a grateful goalkeeper. The referee and his assis- 
tants failed to award the goal, because they could not be sure the ball 
had crossed the line. 

Every fan of football (apologies to US readers but I refuse to use 
the term soccer) has their own story ofa goal that should or shouldn't 
have been given. Those in charge of the game have finally bowed to 
public pressure and next month’s Confederations Cup in Brazil will see 
goal-line technology used in an international tournament for the first 
time. In August, a similar system will be introduced into the English 
Premier League, the richest and highest-profile league in the world. 
The companies involved are already discussing how their machines 
could be applied to help football referees make 
offside, handball and other decisions. Sports offi- 
cials in the United States and Canada are watch- 
ing with interest. 

Yet the introduction of goal-line technology to 
football is likely to perpetrate a mass deception on 
television viewers. It will miss a huge opportunity 
to educate people about the role of uncertainty in 
science. And it will exacerbate the approaching 
danger of fake computer-generated video footage. 

Have you seen how a tennis player’s challenge 
of a close line call is adjudicated by the Hawk- 
Eye system? I confess that until recently I was 
one of the vast majority who believed they were 
watching an accurate replay of reality. In fact, the 
system combines multiple high-speed camera 
images to estimate the ball’s trajectory and position when it hits the 
ground. Nothing in the graphics indicates to viewers the inevitable 
uncertainty and potential for error in that estimation. As it stands, the 
same will apply to goal-line replays. 

Fans are likely to share a misunderstanding that apparently 
reaches right to the top of the game. Announcing the introduction of 
Hawk-Eye to football in April, the Premier League's chief executive, 
Richard Scudamore, said that the technology would reveal “defini- 
tively” whether the ball had crossed the line. 

Scudamore may not have read the 1726 edition of Isaac Newton's 
Philosophice Naturalis Principia Mathematica, which stresses the role 
of uncertainty in the scientific method. Measurement involves error. 
Results are accompanied by confidence limits and levels. With public 
opinion key to decisions on topics such as climate change, nuclear power 
and genetic modification, calls to spread aware- 


A Imost three years ago, the English footballer Frank Lampard 


ness of uncertainty and probability in scienceare DNATURE.COM 
increasing, even if they are often drowned out by _ Discuss this article 
the comforting simplifications of certainty that _ online at: 
characterize mainstream reporting. go.nature.com/3upjdk 


THE INTRODUCTION 
OF GOAL-LINE 


TECHNOLOGY 
TO FOOTBALL 
IS LIKELY TO 
PERPETRATE A MASS 


DECEPTION. 


Tell fans definitive calls 
are an impossible goal 


The electronic eyes that will watch football goal lines this summer may settle 
arguments, but they still depend on probability, says Nic Fleming. 


In 2011, sociologists at Cardiff University, UK, published a paper 
on the use of Hawk-Eye in sports (H. Collins & R. Evans Public 
Understanding of Science http://doi.org/frmw94; 2011). The system 
introduced a “false transparency” in tennis, they said, and should be 
supplemented with visual error bars or numerical statements of con- 
fidence in the screened Hawk-Eye graphics. 

Cricket fans already understand this. In leg-before-wicket decisions 
(sorry again US readers — look it up) there is a ‘zone of uncertainty’ 
where Hawk-Eye admits that it cannot be certain whether or not the 
ball would have hit the stumps. It defers to the judgement of the human 
umpire and this is acknowledged to television viewers with an ‘on-field 
cal? message. Football fans will be denied such information. 

Both Hawk-Eye and the (separate) GoalControl technology that will 
be used in the Confederations Cup train seven high-speed cameras 
on each goal. Computer software then combines 
the two-dimensional images into three-dimen- 
sional representations of the ball and its position. 
Both systems can probably provide more accu- 
rate decisions than human referees. Hawk-Eye 
claims a minimum accuracy of + 6 millimetres 
whereas GoalControl claims + 5 millimetres. 
These claims cannot be verified as the Interna- 
tional Federation of Football Associations refuses 
to release the test results, beyond confirming that 
both systems meet its minimum requirement of 
+3centimetres in tests of balls moving at 70 kilo- 
metres per hour. 

The public is largely aware that still images can 
be airbrushed, improved and otherwise manip- 
ulated. Many media organizations have strict 
codes governing digital tinkering. Earlier this year, North Korea was 
seemingly caught out cutting and pasting extra hovercrafts into an 
image to boost its apparent military muscle. 

Moving images can seem more realistic. Yet we are rapidly 
approaching a time when computer-generated moving images will 
look as if they have been filmed. We will be able to create entirely 
faked video footage. This is useful to create an impressive mountain 
backdrop for a film. But what about when political propagandists or 
those with commercial motives start to exploit such technology? 

If video footage is to retain credibility, we need greater transparency 
about visual representations of events, so we can distinguish replays 
from reconstruction. On-screen honesty on the limitations of goal-line 
technology would do this, and put a great many people in the picture 
about science and uncertainty. Football is only a game but it is a good 
place to start. = 


Nic Fleming is a freelance science and technology journalist in 
London and a Sheffield United fan. 
e-mail: nicfleming106@hotmail.com 
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RESEARCH HIGHLIGHTS 


| _NEUROSCIENCE 
Bee exploration 
mechanism 


Honeybees orient using similar 
molecular pathways to many 
vertebrates. 

Gene Robinson and Claudia 
Lutz at the University of 
Illinois at Urbana-Champaign 
identified a protein in 
honeybees (Apis mellifera) 
similar to the transcription 
factor Egr1, which is 
expressed in the brains of 
vertebrates such as rodents 
during the exploration of new 
environments. 

Egr was upregulated only in 
regions of the bee brain called 
mushroom bodies — which 
integrate sensory input and 
process memories — and only 
as the bees learned to orient 
in unfamiliar surroundings. 
This was true of both young 
bees that had previously 
never left the hive and 
experienced foragers placed 
in anewenvironment. The 
results demonstrate the deep 
evolutionary conservation 
of Egr-related molecular 
pathways in experience- 
dependent learning, say the 
authors. 

J. Exp. Biol. 216, 2031-2038 
(2013) 


| GENOMICS 
lrish-famine 
pathogen decoded 


Researchers have sequenced 
the genome of the 
microorganism that devastated 
the Irish potato crop in the 
1840s — the first time the 
genome ofa historical plant 
pathogen has been decoded. 

Kentaro Yoshida at the 
Sainsbury Laboratory in 
Norwich, UK, and his 
colleagues identified the strain 
of Phytophthora infestans, 
which causes potato late blight, 
from nineteenth-century 


GEOLOGY 


Gemstones from the deep 


The precious stones jade and ruby can be used to identify the 
sites of ancient collisions of tectonic plates. 

Robert Stern at the University of Texas at Dallas and his 
colleagues suggest that these two substances could be called 
“plate tectonic gemstones”. Jadeitite, a type of jade, forms 
where one tectonic plate plunges beneath another. Fluids that 
rise from the diving slab of oceanic crust condense to form 
the gemstone (pictured in an Olmec mask from southern 
Mexico — the site of one such subduction zone). By contrast, 
ruby forms where continental crust rich in aluminium 


collides, as in Southeast Asia. 


These stones should be recognized not only for their 
beauty but also for what they reveal about their tectonic 


setting, the authors say. 


Geology http://dx.doi.org/10.1130/G34204.1 (2013) 


dried leaves. The researchers 
sequenced DNA from 11 
historical specimens, which 
had been stored in herbaria, 
and from 15 modern strains 
of P. infestans. The famine 
strain was closely related to 
another strain that is still 
prevalent around the world, 
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and the researchers suggest 
that the two strains diverged 
in the early 1800s. The famine 
strain may now be extinct, the 
authors say. 

eLIFE http://dx.doi.org/10.7554/ 
elife.00731 (2013) 

For a longer story on this research 
see go.nature.com/qg2aov 
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Selections from the 
scientific literature 


Footprints reveal 
hominin size 


Fossil footprints indicate that 
hominins were already as 
large as modern humans by 
1.52 million years ago. 
Undamaged fossil 
skeletons from that time are 
rare, and so determining 
characteristics such as the 
size and walking speed of 
human ancestors has been 
challenging. Brian Richmond 
and Heather Dingwall 
at George Washington 
University in Washington 
DC and their colleagues 
measured foot size and stride 
length from fossil footprints 
of seven individuals — which 
were probably Homo erectus 
or Paranthropus boisei — 
discovered in northern 
Kenya. To translate these 
measurements into physical 
attributes such as stature, 
body mass and walking 
speed, the researchers 
studied the relationship 
between body dimension and 
gait in habitually barefoot 
modern adults from Kenya. 
The authors were then able 
to infer that the size of these 
hominins was comparable to 
that of modern humans. 
J. Hum. Evol. 64, 556-568 (2013) 


CHEMISTRY 


Metabolites, 
cell by cell 


Single-cell measurements 
are revealing how individual 
yeast cells react to 
environmental and genetic 
challenges. 

Advances in mass 
spectrometry, which 
identifies individual 
compounds in complex 
mixtures, have given 
researchers the chance to 
compose cell-by-cell portraits 
of metabolism. Renato 


PETER HORREE/ALAMY 


SEVEN DAYS 


MRI spy charges 


Three Chinese researchers 
working in the United 

States have been charged 

with commercial bribery, 
announced the Federal Bureau 
of Investigation and the US 
Attorney’s Office in Manhattan 
last week. Officials say that 
radiology professor Yudong 
Zhu recruited Xing Yang and 
Ye Li to help him develop 
magnetic resonance imaging 
(MRI) technology at New York 
University’s Langone Medical 
Center. While funded by a 
multimillion-dollar grant from 
the US National Institutes of 
Health, the three allegedly 
took payments from a rival 
medical-imaging company and 
a research institute in China 

in exchange for confidential 
information about their 
research. 


Spain unrest 

In response to steep budget 
cuts, Spanish scientists, 
university officials and 

science advocates have called 
for protests in defence of 
science in the country. Ina 

21 May document, the Open 
Letter for Science movement 
urged participation in 
demonstrations on 14 June 
against what it described as the 
ruin of the Spanish research 
system. Spain has reduced 

its state budget for science 

by roughly 40% since 2009, 
and research funds for 2013 
are being delayed. The group 
asks the government to boost 
science funding and implement 
measures to retain the country’s 
scientific talent. 


Lobbying lull 

Citing huge advances under 
US President Barack Obama 
in the availability of human 
embryonic stem cells for 
US-funded researchers, one 
key advocacy group says that 
its work is done. The Coalition 


The news in brief 


Eye of the storm 


The National Oceanic and Atmospheric 
Administration (NOAA) has predicted an 
extremely active Atlantic hurricane season, six 
months after Hurricane Sandy devastated the 
US northeast (pictured). In its forecast released 
on 23 May, the agency says there is a 70% chance 
that 7-11 hurricanes, including 3-6 major ones, 
will form between 1 June and 30 November. 


for the Advancement of 
Medical Research, based in 
Washington DC, announced 
its dissolution on 21 May, after 
lobbying for federal funding 
for the controversial research 
since 2001. The news comes 
nearly five months after the US 
Supreme Court refused to hear 
a legal challenge that attempted 
to stop government support 

of the research (see go.nature. 
com/D5jegit). 


Deep shocks 

Eastern Russia was struck 
bya deep, magnitude-8.3 
earthquake off the Kamchatka 
peninsula on 24 May. The 
epicentre was located 

610 kilometres below Earth's 
surface in the Sea of Okhotsk 
and about 400 km northwest 
of the city of Petropavlovsk- 
Kamchatsky, according to 
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the US Geological Survey. 
Data suggest that the quake 
ruptured a deep fault in the 
Pacific plate, which is being 
forced under neighbouring 
plates. See go.nature.com/ 
kipwjv for more. 


Remodelled sub 
With upgrades complete, the 
US flagship submersible Alvin 
is en route to Astoria, Oregon, 
from where it will return to 
exploring the dark depths 

of the oceans in December. 
Alvin left the Woods Hole 
Oceanographic Institution 

in Massachusetts on 25 May 
aboard the research ship 
Atlantis, after being taken 

out of service in 2010 fora 
US$41-million makeover. 
Among its improvements is 
anew titanium sphere — in 
which researchers sit during 
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“There are no mitigating factors that we can see 
that would suppress the activity,’ says Gerry Bell, 
lead hurricane forecaster for NOAA. But the 
agency says that a new supercomputer and an 
updated forecasting model — fed with real-time 
Doppler radar data collected by aeroplanes 

that fly through storms — should improve the 
accuracy of its hurricane tracking. 


dives — equipped with extra 
windows and high-definition 
cameras. See go.nature.com/ 


mdgxhy for more. 


Antibiotics deal 


In the first partnership of its 
kind, the US government 
will pay GlaxoSmithKline 
up to US$200 million to 
develop drugs to combat 
antibiotic resistance and 
bioterrorism, the London- 
based pharmaceutical giant 
announced on 22 May. 

The Biomedical Advanced 
Research and Development 
Authority, a federal office 
established to address public- 
health emergencies and 
biodefence threats, will pay 
the company $40 million 
over 18 months and up to 


ROBERT SIMMON/GOES PROJECT SCIENCE TEAM/NASA/NOAA 
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$160 million more if the 
contract is renewed over 
5 years. 


Sleep aid 

On 22 May, advisers to US 
drug regulators recommended 
approval ofa first-in-class 
therapy for insomnia, but 
proposed its use only at 

low doses. The drug, called 
suvorexant, is made by Merck, 
a pharmaceutical company 

in Whitehouse Station, New 
Jersey. Ifapproved, it will be the 
first insomnia drug to block 
orexin, a neurotransmitter that 
regulates sleep-wake cycles. 
Reviewers at the US Food 

and Drug Administration 
expressed concern about the 
side effects seen at higher 
doses, including suicidal 
thoughts and narcolepsy-like 


symptoms. 


ee PEDRLE S| 
Shaw prizes 


Six scientists in Britain and 

the United States won this 
year’s Shaw prizes, each worth 
US$1 million. Jeffrey Hall at the 
University of Maine in Orono, 
Michael Rosbash at Brandeis 
University in Waltham, 
Massachusetts, and Michael 
Young at the Rockefeller 
University in New York share 
the prize in life sciences and 
medicine for work on circadian 
rhythms. Steven Balbus 
(pictured) at the University of 
Oxford, UK, and John Hawley 


TREND WATCH 


The United Nations is leading 
a push to give the entire world 
access to electricity and less- 
polluting cooking fuel by 


\ a 


at the University of Virginia 
in Charlottesville won in 
astronomy for work on disks 
of gas and dust around stars. 
The mathematics prize went 
to David Donoho of Stanford 
University in California, for 
advances in statistics. 


| RESEARCH 
Stem-cell censure 


Criticisms of apparent data 
duplication have tarnished 

a recent study in Cell that 
reported the creation of 
human stem-cell lines through 
cloning. An anonymous 
online commenter noted 
apparent duplicated images 
and scatterplots in the paper 
by Shoukhrat Mitalipov of the 
Oregon Health and Science 
University in Portland, and 
his colleagues. Mitalipov says 
that the duplications were 
innocent mistakes, and is 
working with Cell to correct 
them. See page 543 for more. 


Swiss biotech site 


Plans for a major Swiss 
biotechnology initiative 

have moved forward with 

the purchase ofa host site, 
announced on 22 May. The 
Campus Biotech consortium, 
backed by billionaires Hansjérg 
Wyss and Ernesto Bertarelli, 
has bought the 45,000-square- 
metre former Merck Serono 
facility in Geneva, Switzerland, 
for an undisclosed amount. The 
group plans to open an institute 
for bio- and neuroengineering 
in collaboration with the Swiss 
Federal Institute of Technology 
in Lausanne and the University 
of Geneva. See go.nature.com/ 
bxaqxp for more. 


Making waves 

Plans for the world’s largest 
wave-energy farm, off the 
northwest coast of Lewis 

in Scotland, were given the 
all-clear by the Scottish 
government on 22 May. 
Edinburgh-based firm 
Aquamarine Power will install 
40-50 wave-energy devices, 
with an overall power capacity 
of about 40 megawatts once 
electricity cables are installed, 
which will probably be 2017 at 
the earliest. 


Arctic break-up 
Fracturing Arctic ice has 
prompted Russia to order the 
evacuation of its North Pole 40 
research station, which collects 


WIDER ACCESS TO CLEAN ENERGY NEEDED 


World falling short of 2030 target to expand global access to 


electricity and clean household fuels. 
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2030. The agency also aims for 
renewables to constitute a greater 
share of energy consumption. 
But initial statistics released on 
28 May ina Global Tracking 
Framework report (see chart) 
show that those targets will 

be hard to reach, mainly 

because rising populations and 
economic growth are countering 
improvements. See go.nature. 
com/gyowo6 for more details. 
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SEVEN DAYS | THIS WEEK | 


1-5 JUNE 

The interaction between 
human activities and 
the environment takes 
centre stage at the third 
World Summit on 
Evolution, held in the 
Galapagos Islands. 
go.nature.com/reklva 


2-6 JUNE 

Updates on the search 
for habitable worlds 

and results from the 
Curiosity Mars rover are 
among the programme 
highlights of the 222nd 
American Astronomical 
Society meeting in 
Indianapolis, Indiana. 
go.nature.com/2mqa71 


meteorological observations 
and monitors pollution. 

The ice floe that hosts the 
16-crew station has broken 
into six pieces, according to 
news reports last week. In a 
statement, the environment 
ministry expressed concern 
for staff safety, and noted the 
possible risk of environmental 
contamination near Canada’s 
economic zone in the event of 
a disaster. 


POLICY 


Water priorities 
The US Agency for 
International Development 
released its first global water 
and development strategy on 
21 May. The foreign-assistance 
agency already spends about 
US$452 million each year on 
water-related programmes, 
but said that the new plan 
“elevates the importance” 

of water as a priority for the 
group. From this October, 
the organization will focus all 
new water programmes on 
two goals: improving human 
health and enhancing food 
security by providing clean, 
sustainable water. 


> NATURE.COM 
For daily news updates see: 
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STEM CELLS 


Shoukhrat Mitalipov, lead author of human stem-cell cloning paper, admits errors. 


Fallout from hailed 
cloning paper 


Duplicated figures raise debate over expedited publication. 


BY DAVID CYRANOSKI 


ow fast is too fast for review of a scien- 
H tific article? And who has the respon- 

sibility to ensure accuracy? Errors 
found in a widely acclaimed cloning study have 
rekindled those questions — and sent the lead 
author and the journal that published it scram- 
bling to assure the world that the problems did 
not compromise the findings. 

The paper, which was published online by 
the journal Cell on 15 May (http://doi.org/ 
mkn), reported the creation of human embry- 
onic stem-cell lines from cloned human skin 
cells. The lines are expected to answer funda- 
mental questions about the way in which cells 
are reprogrammed and also to have potential 


therapeutic applications. 

But last Wednesday, after an anonymous 
online commenter noted three pairs of dupli- 
cated images with conflicting labels in the 
paper, excitement turned to confusion — and 
a bit of déja vu. The last time the same feat was 
claimed — by then Seoul University profes- 
sor Woo Suk Hwang — duplicate images were 
noted anonymously and the breakthrough was 
later debunked. Nobody is claiming more than 
sloppiness in the present case, and the authors 
quickly stepped up to 


put the record straight. | NATURE.COM 
The leader of the Formoreon 

study, Shoukhrat Mital- __ the cloning 

ipov, a reproductive- _ breakthrough go to: 

biology specialist at the —_go.nature.com/tejhix2 
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Oregon Health and Science University in Port- 
land, acknowledges that the team did use three 
sets of duplicated images. In two cases that was 
intentional, but a pair of labels was mistakenly 
reversed, he says (see http://doi.org/mnk). 

In the third case, an incorrect scatterplot 
was included in a table examining the pat- 
terns of gene activity in the cloned stem-cell 
lines — one that had already been used in 
the paper. 

Mitalipov says that corrections will be made. 
He waves off another criticism raised by the 
anonymous commenter concerning the sur- 
prisingly tight shape of two scatterplots: they 
just turned out that way, he says. 

“The results are real, the cell lines are real, 
everything is real,’ he says. 

The editor-in-chief of Cell, Emilie Marcus, 
released a statement in response to the criti- 
cisms, defending the paper: “Based on our own 
initial in-house assessment of issues raised and 
in initial discussions with the authors, it seems 
that there were some minor errors made by the 
authors when preparing the figures for initial 
submission. While we are continuing discus- 
sions with the authors, we do not believe these 
errors impact the scientific findings of the 
paper in any way.” 

Most researchers seem ready to give Mital- 
ipov the benefit of the doubt, on the basis of his 
answers and his track record. “The explana- 
tions [by Mitalipov] are plausible, but we will 
have to wait for the results of a thorough inves- 
tigation,” says Martin Pera, a stem-cell expert 
at the University of Melbourne, Australia. 

Robin Lovell-Badge, a developmental biolo- 
gist at the Medical Research Council's National 
Institute for Medical Research in London, 
also accepts that the errors resulted just from 
sloppiness. “I really don’t think in this case it’s 
any attempt to manipulate. It’s not the data 
you would want to manipulate, anyway,” 
he says. 

But many also noted that the paper had 
been published with blazing speed — Cell 
accepted it just three days after receiving it 
and published it online 12 days later — and 
questioned whether such rapid publication is 
good for science. “Whatever the explanation is, 
it’s amazing that there is another issue with a 
paper in SCNT [somatic-cell nuclear transfer]. 
The four-day review process was obviously 
inadequate,’ says Arnold Kriegstein, director 
of the stem-cell programme at the University 
of California, San Francisco. > 
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>  Lovell-Badge says that the mistakes 
still should have been caught, by both the 
authors and the reviewers and the journal, 
particularly one staffed by a team of pro- 
fessional editors. “It’s the author’s problem 
but if it’s a journal like Cell or Nature, then 
I think, the journal, they have to take some 
responsibility to make sure there are no 
errors,’ he says. 

Veronique Kiermer, executive editor at 
Nature Publishing Group, says that “this 
type of image manipulation is not some- 
thing we expect referees to pick up. It’s not 
really their role.” She says that the Nature 
journals do “spot-checking” — on aver- 
age, of two papers per issue. “But duplica- 
tions between different figures, or between 
supplementary figures and the main text, 
are difficult to pick up,” she says. 

Mike Rossner, former executive director 
of the Rockefeller University Press, imple- 
mented image screening as part of the 
press’s review process in its journals, such 
as the Journal of Cell Biology, in 2002. He 
says that all journals should do the same. 
Rossner says that inappropriately manipu- 
lated images appeared in as many as 25% of 
articles checked, although only about 1% 
affected the interpretation of the data. “I 
cannot speculate on whether our image- 
screening process would have caught these 
particular issues [in Mitalipov’s Cell paper], 
but the process does include a visual 
check for duplicated regions in a figure,” 
he says. 

Marcus defends the fast peer review: “It 
is a misrepresentation to equate slow peer 
review with thoroughness or rigor or to 
use timely peer review as a justification for 
sloppiness in manuscript preparation,” she 
wrote in her statement. 

Mitalopov admits that a rush to publish 
might have led to mistakes that he com- 
pares to “typos. But if he had to do it over, 
he says he wouldn't take any more time or 
do anything differently — other than to 
“make sure the errors weren't there.” 

“We had four cell lines in December. 
We had five months to put together data. 
Most images and most figures were ready 
months in advance. The project was done, 
completed, written,” he says. 

“If you have a paper that’s that hot, it will 
go through quickly. Other projects will take 
more time,’ he says. “It’s better than [it] sit- 
ting on someone's desk for 6 months.” 

Mitalipov is arranging to give about 
a dozen other groups access to the cells, 
which would allow them to validate the cell 
lines. “The first thing we want to do is have 
people confirm our results,” he said. “We 
are not hiding these cell lines?” = 


Additional reporting by Ewen Callaway, 
Erika Check Hayden and Richard Van 
Noorden. 


544 | NATURE | VOL 497 | 30 MAY 201 


3 


Immunotherapy’s 
cancer remit widens 


Combination therapies hold great promise, but at what cost? 


BY HEIDI LEDFORD 


rugs that unleash the power of the 
D immune system on cancers are gener- 

ating considerable optimism in indus- 
try, but still Andrew Baum thinks analysts are 
selling them short. In a 22 May report, Baum, 
the London-based head of global health-care 
research at the investment bank Citi, forecasts 
that in ten years the drugs will be treating 60% 
of cancers and earning US$35 billion a year. 

Three elements contribute to his bullish- 
ness: the drugs are showing signs of wider 
effectiveness; many patients will take them for 
years; and the prices are stratospheric (see ‘Stiff 
medicine). 

One of the first such drugs to be approved, 
Yervoy (ipilimumab), costs about $40,000 
per month in the United States, and £15,000 
($23,000) in the United Kingdom, where 
health-care officials negotiated a lower price. 
So far, expensive immunotherapies have 
been approved only for treating melanoma 
and prostate cancer. But this weekend, at the 
annual meeting of the American Society of 
Clinical Oncology in Chicago, Illinois, investi- 
gators will present promising results from tri- 
als that indicate that immunotherapies could 
soon have a role in treating cancers of the lung, 
kidney and stomach. 

Anew strategy will add to the costs: regimens 
that combine the drugs with radiation therapy, 
genetically targeted drugs and other immuno- 
therapies. The hope is that these other treat- 
ments will enhance the ability of the immune 
system to recognize the tumour, either by 
further stimulating the immune system, or by 


STIFF MEDICINE 


The cost of treating cancer is surging, 
with immunotherapies at the fore. 
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damaging the tumour so as to release antigens 
that the immune system recognizes. 

Yervoy, made by Bristol-Myers Squibb, 
headquartered in New York, is remarkable 
because a three-month course can send cancer 
into remission for years (see go.nature.com/ 
kle3m2). However, only about one-quarter of 
patients with advanced melanoma responds to 
the drug, which spurs tumour-killing T cells 
into action by blocking an inhibitory signal. 
“When Yervoy was first approved in 2011, we 
all recognized that it was an important moment 
for the field” says Jedd Wolchok, an oncologist 
at the Memorial Sloan-Kettering Cancer Center 
in New York. “But it was also a call to do better” 

On 2 June, Wolchok will present data from 
a trial that combined Yervoy with an experi- 
mental drug called nivolumab, also made by 
Bristol-Myers Squibb. It releases a second 
brake on the immune system by stifling a dif- 
ferent protein. The number of patients in the 
trial was small, but the results suggest that mel- 
anomas shrank in half of those who received 
the highest two doses tested. 

Combination approaches could expand the 
number of cancers that respond to immuno- 
therapy. But they could also accelerate the rise 
in the cost of cancer care. US spending on can- 
cer drugs is rising by some 15% a year, twice as 
fast as heath-care costs overall. “Cancer is a very 
complicated and expensive disease,” says Scott 
Ramsey, a health-care economist at the Fred 
Hutchinson Cancer Research Center in Seattle, 
Washington. “But now it’s turning into a chronic 
disease, and we're talking about years of main- 
tenance therapy with drugs that cost $10,000 
a month,’ However, not all of the inflation is 
due to expensive new drugs, says Ira Klein, a 
medical director at insurance company Aetna, 
based in Hartford, Connecticut. The price of 
radiation therapy is rising by 25% per year, he 
notes, driven by new technologies that can cost 
$100,000 or more for a full course of treatment. 

Drug developers attribute the high cost 
of the drugs to the expense of research and 
development, which is compounded by the 
industry’s high failure rate. Wolchok notes that 
drugs such as Yervoy have a small market, and 
speculates that prices may fall once their mar- 
kets expand to other cancers. But Peter Bach, 
who studies health-care policy at Memorial 
Sloan-Kettering, is sceptical that industry will 
cut prices. “I have never seen that,’ he says. “I 
have only seen the opposite. = 


PETER BACH 
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Long-lived insects 
raise prime riddle 


Mass orgy of 17-year cicadas sets US researchers buzzing. 


BY RICHARD MONASTERSKY 


rivers who end up behind John Cooley 
D this week will quickly lose their 

patience. Cruising around the eastern 
United States with his car window open, he 
slows down or stops every few hundred metres, 
cocks an ear and taps ona data-logger strapped 
into the passenger seat. 

Since last week, Cooley, an evolutionary 
biologist at the University of Connecticut in 
Storrs, has been on the road mapping popu- 
lations of periodical cicadas (Magicicada). 
These loud, red-eyed insects have spent the 
past 17 years maturing underground, only 
to emerge this month by the billions for a 
few weeks of singing and sex before they die. 
Like a handful of other cicada researchers on 
the prowl from North Carolina to New York, 


> 


MORE 
ONLINE 


TOP STORY 


Cooley knows that he has to work quickly. 
“Time is the real enemy here, for both the 
cicadas and the researchers,” he says. “If you 
miss this opportunity, you have a hole in 
your map and you have to wait for another 
17 years.” 

The insect genus with the longest known 
life cycles, Magicicada has confounded scien- 
tists for centuries. In 1665, the first volume 
of Philosophical Trans- 
actions of the Royal — 
Society included a 
report from New 
England concerning 


Periodical cicadas 
emerge in broods of 
up to 350 insects per 
square metre. 


| MORE NEWS | 
Emerging @ Unusually strong deep- 
H7N9 mantle earthquake strikes off 
flu easily coast of Russia go.nature.com/kipwjv 
transmits @ Graphene produces ultrashort 
between laser pulses go.nature.com/nldahd 
ferrets go.nature. | @ Protein has key role in specialized 
com/pzwpst itch neurons go.nafure.com/8pmbec 


© 2013 Macmillan Publishers Limited. All rights reserved 


IN FOCUS | NEWS 


“swarms of strange insects, and the mischiefs 
done by them” Charles Darwin also puzzled 
over them. Even now, entomologists are try- 
ing to understand how the insects’ peculiar 
life cycles evolved, how they count the years 
underground and how they synchronize their 
schedules. “They are one of the big ecologi- 
cal mysteries out there,” says Walt Koenig, a 
behavioural ecologist at the Cornell University 
Lab of Ornithology in Ithaca, New York. 
They are also an entomological rarity. Of 
the thousands of cicada species known around 
the world, only the seven Magicicada species, 
which live in the eastern and central United 
States, have evolved such prolonged and syn- 
chronized life cycles (see ‘A frenzied horde’). 
At the southern end of their range, Magici- 
cada populations have split into three mixed- 
species broods that emerge every 13 years. In 
the northern region, 12 broods follow 17-year 
cycles. This year’s crop belongs to Brood II, 
one of the larger 17-year groups and one that 
emerges in major metropolitan regions on the 
east coast. 
Since their last outing in 1996, Brood II 
cicadas have grown through five larval stages 
underground, where they survived by suck- 
ing fluids from tree roots. With the warm 
weather this month, the nymphs have been 
crawling out of the ground before moulting 
for one last time and taking wing. Reaching 
densities of up to 350 individuals per square 
metre in woodlands, the cicadas can chorus 
at more than 95 decibels — loud enough to 
harm human hearing — as the males woo the 
females. After mating, the females cut slits 
into tree branches and lay their eggs there. By 
the time the eggs hatch, in six to ten weeks, the 
ground will be littered with the bodies of their 
parents. The new generations of nymphs will 
fall to the ground, burrow into the soil and 
remain there until 2030. 
Biologists generally agree that the giant 
synchronized emergence of periodical 
cicadas overwhelms potential predators, 
allowing some of the relatively defenceless 
insects to reproduce. And some research- 
ers have proposed that the cicadas have 
evolved life cycles around prime num- 
bers of years because that arrangement 
limits the chances that predators 
will synchronize with the 
cicadas. But these ideas 

do not address > 
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> why the generations specifically last for 
13 or 17 years. 

Koenig suggests that the answer may involve 
interactions with birds. He and Andrew Lieb- 
hold of the US Forest Service in Morgantown, 
West Virginia, analysed 45 years of data from 
the North American Breeding Bird Survey (W. 
D. Koenig and A. M. Liebhold Am. Nat. 181, 
145-149; 2013) and found that bird popula- 
tions tend to fall during the years in which 
periodical cicadas emerge. Birds feed on cica- 
das, so Koenig expected to find the opposite 
pattern. He proposes that the masses of cicadas 
trigger long-term changes in the forest that end 
up causing bird populations to crash after 13 or 
17 years. The mechanism remains a mystery, 
but Koenig notes that one factor could be the 
flood of dead cicadas, whose bodies are 10% 
nitrogen. The die-off sends a pulse of ferti- 
lizer into the forest that temporarily enhances 
plant growth but could later lead to unfavour- 
able conditions for birds. “It's a pretty weird 
hypothesis,” he admits. 

To synchronize their emergence, the 
nymphs must somehow keep track of how long 
they have been underground. Gene Kritsky, 
an entomologist at the College of Mount St. 
Joseph in Cincinnati, Ohio, says that nymphs 
seem to count the number of times that trees 
set their leaves in the spring; in 2007, some 
Brood XIV cicadas emerged a year early, fol- 
lowing a strong winter thaw during which trees 
produced leaves, then dropped them and grew 
new ones in the subsequent spring. But no one 
knows how cicadas ‘remember’ the number of 
years since they last emerged. 

Researchers are making more progress in 


A cicada moults for the last time before taking wing. 


probing the biological mechanisms that allow 
cicadas to switch their life cycles. In an analysis 
of DNA markers published this year (T. Sota 
etal. Proc. Natl Acad. Sci. USA 110, 6919-6924; 
2013), a team including Cooley developed an 
evolutionary tree for Magicicada and found that 
the major species groups had repeatedly split 
into 13-year and 17-year cohorts. The research- 
ers suggest that those splits are explained by a 
common genetic mechanism across the species. 

Chris Simon, a co-author and evolution- 
ary biologist at the University of Connecticut, 


plans to follow up those results with several 
genetic studies, including sequencing the RNA 
transcripts of genes that are active at different 
stages in the cicada life cycle. She is particularly 
interested in probing the occasional tendency 
of periodical cicadas to emerge 4 years early or 
late. These ‘stragglers’ are easy prey so do not 
usually survive, but Simon and others suggest 
that the timing mistakes might have given rise 
to new broods in the past. “It’s a way of having 
instant speciation,” she says. “This ability to 
jump through time is something that has not 
been seen before in other organisms.” 

One example of this time travel might be 
happening right now in north-central Cin- 
cinnati, at least 500 kilometres from Brood IT's 
range. Kritsky documented thousands of 
cicadas appearing last week in a spot where he 
saw stragglers in 2000, four years before the 
city was inundated with the expected 17-year 
cicadas of Brood X. 

The arrival of cicadas in the same place 
this year might mean that an environmental 
change such as global warming is causing them 
to emerge early, or that a genetic factor has 
caused some members of the 17-year Brood X 
to switch to a 13-year life cycle, says Kritsky. He 
must wait another 4 years to see whether some 
of the stragglers from 2000 have reverted back 
to their 17-year schedule. The main pulse of 
Brood X will arrive in 2021. 

By then, Kritsky will be 68 years old. The 
long generations of the periodical cicadas 
makes studying them difficult, he says. “You 
would think we'd have a lot of answers but 
we don't. Very few researchers have seen five 
generations.” = 
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Tepid showing for 
enomics X prize 


Challenge may be too hard and commercially unnecessary. 


BY ERIKA CHECK HAYDEN 


but neither was it meant to be a flop. Yet as 

the 31 May registration deadline looms for 
the Archon Genomics X Prize — a challenge to 
sequence 100 complete human genomes in 30 
days at unparallelled accuracy and low cost — 
only two teams have entered. 

The lacklustre showing is a testament to both 
the difficulty of the challenge and the matura- 
tion of the DNA-sequencing industry in the 
seven years since the prize was first conceived, 


[: was never meant to be a piece of cake — 
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genetics and innovation researchers say. 

“The business has become bigger than the 
prize,” says Jonathan Rothberg, founder of the 
sequencing company Ion Torrent in Guilford, 
Connecticut, which was acquired in 2010 by 
Life Technologies in Carlsbad, California — 
which was, in turn, recently snapped up for 
US$13.6 billion by Thermo Fisher in Waltham, 
Massachusetts. Ion Torrent plans to compete, 
but other firms have apparently decided that 
they have little to gain. 

Yet the goal of the prize — to drive down 
the cost of sequencing while improving 
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its quality — matters just as much as it did 
in October 2006, when the X Prize Foun- 
dation, based in Playa Vista, California, 
first announced the challenge, experts say. 
Although sequencing costs have fallen drasti- 
cally (see ‘Plummeting costs’), that decline has 
plateaued recently. 

The original rules called for teams to 
sequence 100 genomes in 10 days for less than 
$10,000 per genome. After none of the original 
eight competitors could meet the 10-day time- 
frame, the foundation spent two years revamp- 
ing the challenge. The reconceived prize, 
launched in October 2011, extended the time 
to 30 days, tightened the cost to $1,000 per 
genome and specified that 100 genomes from 
centenarians, who may harbour life-extend- 
ing genetic variants, must be sequenced (see 
L. Kedes and G. Campany Nature Genet. 43, 
1055-1058; 2011). 

The new challenge aims at what the X Prize 
Foundation calls a “medical grade genome” — 
a sequence of all the nuclear DNA to 98% 
completeness and high accuracy, allowing 
only one error per million bases. To win the 
$10-million prize pot, teams must also find 


SOURCE: NHGRI 
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OTHER CHALLENGES 


Popular bioinformatics 


The X prize isn’t the last word on genomics 
competitions. Cheaper to solve than 
technology-based contests, bioinformatics 
challenges have proved hugely popular. 

The Critical Assessment of Genome 
Interpretation experiment has seen the 
number of entrants rise every year since It 
was first held in 2010 (see go.nature.com/ 
dfclt1). And a US$1-million purse offered by 
the US Defense Threat Reduction Agency 
has spurred thousands of researchers to try 
to identify individual organisms from mixed 
pools of sequenced DNA. 

Thousands of participants also competed 


DNA insertions, deletions and rearrangements 
within genes and determine which parent each 
one came from. 

Hitting all these goals in one go is hugely chal- 
lenging. Market leader Illumina in San Diego, 
California, boasts a rate of false-positives (inac- 
curately flagging a DNA base as a variant from 
normal) of 0.25% and a rate of false negatives 
(missing a real variant) of 7.4%, well above the 
error rates allowed by the X-prize requirements. 
Itis hard to do better because current technolo- 
gies sequence the genome in short stretches 
that then have to be reassembled, introduc- 
ing errors. New ways to sequence longer seg- 
ments in one go are commercially available (see 
Nature 494, 290-291; 2013), but they are slow 
and expensive. “To date, none of them would 
win the X prize at this scale,” says quantitative 
biologist Michael Schatz at Cold Spring Harbor 
Laboratory in New York. 

Still, why have so few teams even deigned to 
try? Meeting the challenge would cost much 
more than the prize purse, but that has also 
been true of past contests that attracted dozens 
of entrants, such as the Ansari X Prize, which 
required teams to send passengers into space, 
and the US government’s Defense Advanced 


PLUMMETING COSTS 


Advances in sequencing technologies have 
driven a sharp drop in price. 
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in a bioinformatics challenge in April to find 
hidden sequences in DNA data sets (run 
by the journal Genome Biology and curated 
by Michael Schatz of Cold Spring Harbor 
Laboratory in New York and James Taylor 
of Emory University in Atlanta, Georgia). It 
was organized to commemorate the sixtieth 
anniversary of the original research papers 
describing the structure of DNA. 

Now Grant Campany, senior director for 
the Archon Genomics X Prize, says that 
the X Prize Foundation itself is considering 
laying out another challenge aimed at 
genome interpretation. E.C.H. 


Research Projects Agency Grand Challenges, 
one of which catalysed the development of suc- 
cessful self-driving cars. 

Part of the answer is that a genomics prize, 
unlike a rocket launch, isn't easy to explain to 
the public. Asa result it does not have the same 
publicity value, says Luciano Kay, a researcher 
at the Center for Nanotechnology in Society at 
the University of California, Santa Barbara. A 
competition for a self-driving car that can go 
10 kilometres is more attractive than manipula- 
tion of matter or genes at tiny scales to accom- 
plish a very scientific or technical feat, Kay says. 

And the goal of the genomics X prize — to 
sequence whole genomes to medical grade 
rapidly and cheaply — may not bea top com- 
mercial priority at present. The business of 
genomics is already booming on the basis of 
less complete sequences, and Rothberg points 
out that scientists can only interpret the small 
fraction of the genome that codes for proteins 
(the ‘exome’). It is unclear what would be 
gained from an accurate sequence of the rest. 

Still, whoever wins the prize earns the right 
to boast, which explains why Ion Torrent 
decided to compete. It also explains why Ilu- 
mina decided not to: failure would only dent 
its reputation, muses Timothy Harris, who 
develops applied-science tools at the Howard 
Hughes Medical Institute's Janelia Farms cam- 
pus in Ashburn, Virginia. The other entrant 
is a team led by George Church at the Wyss 
Institute at Harvard in Boston, Massachusetts. 

Grant Campany, senior director of the 
genomics X prize, hopes that other teams will 
step up to compete before the contest gets 
under way in September. Even if they dort, 
scientists predict that its goals will be achieved 
within the next few years, whether through the 
prize or not. “If you could deliver that kind of 
performance you would have the commer- 
cial advantage by a large margin over anyone 
else,’ Harris says. “That commercial advan- 
tage is worth way more than the X prize.” mSEE 
EDITORIAL P. 535 
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SCIENCE POLICY 


Japan aims high for growth 


Innovation in science is at the heart of government plans to boost the economy. 


BY DAVID CYRANOSKI 


he Japanese government is working on 
T: plan to revitalize its science workforce 

by boosting opportunities for female 
scientists, attracting top talent from abroad and 
increasing the commercialization of research. 
So what else is new? Over the past decade, suc- 
cessive administrations have had similar goals, 
but little progress has been made. This time, 
analysts and scientists think that things might 
be different. 

Prime Minister Shinzo Abe is riding high 
since his Liberal Democratic Party swept back 
to power in December. He has stimulated the 
economy, ending 15 years of deflation, jump- 
started the stock market and weakened the yen 
to spur exports. His reform platform includes a 
new growth strategy, and central to that strat- 
egy is innovation in science and technology. 

Abe’ cabinet has already committed hundreds 
of billions of yen to space, physics and stem- 
cell research, in a stimulus package announced 
in January. But more aggressive measures are 
yet to come. On 17 May, the Council for Science 
and Technology Policy — the nation’s leading 
science body, which Abe chairs — released 
the first draft of a Comprehensive Science, 
Technology and Innovation Strategy, calling for 
even more fundamental changes. 

To be finalized by the end of this year, the 
strategy includes eye-catching targets for bol- 
stering the country’s intellectual base. It calls for 
30% of new recruits at research institutes and 
universities to be female by 2016, up from 21.2% 
in 2010. And faced with fewer university gradu- 
ates anda shrinking birth rate, Japan is looking 
to open itself up to foreigners. Under the plan, 
international researchers would occupy 20% of 
staff positions at leading research organizations 
by 2020, and 30% by 2030. That would be an 
ambitious jump from the 2010 figure of 3.9%. 

To increase technology transfer, the plan sets 
a goal of doubling the number of collaborations 
worth more than 10 million yen (US$98,000) 
between university and industry by 2030. And 
it calls for the numbers of foreign patent appli- 
cations and collaborations lasting more than 
3 years to similarly double. 

Japan needs to make up ground. It has lost 
the competitive edge it once had over China, 
South Korea and other Asian rivals in indus- 
tries such as microelectronics and pharma- 
ceuticals. “Over the past decade, Japan has 
been stagnant in terms of innovation,” says 
Yuko Ito, head of the Science, Technology and 
Innovation Policy division at the Tokyo-based 


548 | NATURE | VOL 497 | 30 MAY 2013 


Japan’s Prime Minister Shinzo Abe visits a 
lab in Fukuoka. 


National Institute of Science and Technology 
Policy. 

Abe failed to reverse those trends in 2006, 
during his first term as prime minister, with a 
largely ineffectual Innovation 25’ programme. 
Nonetheless, some see reasons for optimism, 
not only about economic success but also for 
a more innovative workforce. The strategy this 
time “is emphasizing the need for cultivating 
human resources, especially women’, says Ito. 

Hisako Ohtsubo, a molecular biologist 
at Nihon University near Funabashi who 
researches gender equality in science, is also cau- 
tiously optimistic. She says that it was a pleasant 
surprise to hear Abe — a conservative who, for 
example, opposes passage of the imperial throne 
to the female blood line — repeatedly mention 
the importance of increasing women’s role in 
the business world. Most startling for Ohtsubo 
was Abe’s statement that childcare across Japan 
should be expanded, something many analysts 
have said is crucial for women’s ability to have 
careers in research and other sectors. “He’s a 
different phenotype; she says. “Before he would 
never have talked about such things.” 

Still, she is waiting to see whether Abe will fol- 
low up his words with the necessary investment 
in programmes to give extra grants to women 
and encourage companies and universities to 
hire women to senior researcher positions, and 
not just as regular staff. “That's the only way we 
will be able to overcome the deeply embedded 
stereotypes in the system,’ Ohtsubo says. 

Piero Carninci, a genomics expert based in 
Yokahama at the RIKEN Center for Life Science 
Technologies with nearly two decades’ experi- 
ence in Japan, has advice for Abe if he is serious 
about courting foreigners. As head of RIKEN’s 
new Division of Genomic Technologies, he 
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is the first foreigner to hold a division direc- 
tor position. The top-down management style 
of mentors, communication problems in the 
laboratory and the Japanese lifestyle can all be 
daunting for people from abroad, he says. 

Carninci’s remedy includes giving foreign 
researchers academic independence and rea- 
sonable start-up budgets, as well as assistance 
in overcoming language problems. Also key: 
offering an equal opportunity with Japanese 
nationals to climb the career ladder. 

Atsushi Sunami, a science-policy expert at 
the National Graduate Institute for Policy Stud- 
ies in Tokyo, says that efforts to hire overseas 
talent may also benefit from one of the themes 
of the government's growth strategy: deregu- 
lation of markets and quasi-governmental 
organizations, including universities. 

Decisions on the main budget items still 
have to be made, Sunami says. A proposal in 
the comprehensive strategy would give an 
annual budget — some 50 billion yen, accord- 
ing to Sunami — to the Council for Science and 
Technology Policy to fund science. Currently, 
the council is only an advisory body. But until 
the budget decisions are made, Sunami says, the 
strategy is “just a piece of paper”. 

Koichi Sumikura of the National Institute of 
Science and Technology Policy thinks that the 
Fukushima nuclear disaster, and the ensuing 
energy shortages, should help to keep innova- 
tion on the agenda. “The disaster is what really 
makes things different this time,” says Sumi- 
kura. “Before, politicians talked about innova- 
tion without indicating any real direction. Now 
there is a clear need to recover from the disaster 
and build a stronger society.” m SEE EDITORIAL P.535 


CORRECTIONS 

The world map in ‘GM crops: A story in 
numbers’ (Nature 497, 22-23; 2013) 
wrongly highlighted Slovenia instead of 
Slovakia as one of Europe’s GM-crop- 
producing nations. The table in the News 
story ‘Log-jam in agency confirmations’ 
(Nature 497, 418-419; 2013) omitted 
Christine Todd Whitman’s full name. In the 
News story ‘Graphene knock-offs probe 
ultrafast electronics’ (Nature 497, 422-423; 
2013), reference 2 should have cited 
volume 438, not 483. And the News Feature 
‘Outward bound’ (Nature 497, 424-427; 
2013) wrongly stated that Voyager 1 and 
Voyager 2 were renamed en route to Jupiter. 
In fact, they were renamed before launch. 
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Method man 


Karl Deisseroth is leaving his mark on brain 
science one technique at a time. 


BY KERRI SMITH 


en Karl Deisseroth moved into his 
first lab in 2004, he found himself 
replacing a high-profile tenant: 


Nobel-prizewinning physicist Steven Chu. “His 
name was still on the door when I moved in,” 
says Deisseroth, a neuroscientist, of the base- 
ment space at Stanford University in California. 
The legacy has had its benefits. When chem- 
istry student Feng Zhang dropped by looking 
for Chu, Deisseroth convinced him to stick 
around. “I don't think he knew who I was. But 
he got interested enough” 

Deisseroth is now a major name in science 
himself. He is associated with two blockbuster 
techniques that allow researchers to show how 
intricate circuits in the brain create patterns of 
behaviour. The development of the methods, he 
says, came from a desire to understand mecha- 
nisms that give rise to psychiatric disease — and 


2013 


from the paucity of techniques to do so. “It was 
extremely clear that for fundamental advances 
in these domains I would have to spend time 
developing new tools,’ says Deisseroth. 

His measured tone and laid-back demean- 
our belie the frenzy that his lab’s techniques 
are generating in neuroscience. First came 
optogenetics', which involves inserting light- 
sensitive proteins from algae into neurons, 
allowing researchers to switch the cells on 
and off with light. Deisseroth developed the 
method shortly after starting his lab, working 
with Zhang and Edward Boyden, a close col- 
laborator at the time. Optogenetics has since 
been adopted by scientists around the world to 
explore everything from the functions of neu- 
ron subtypes to the circuits altered in depres- 
sion or autism. Deisseroth has lost count of 
how many groups are using it. “We sent clones 
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to thousands of laboratories,’ he says. 

Now his lab is gearing up for another rush, 
after publishing a method called CLARITY’. 
The technique uses a chemical treatment 
to turn whole brains transparent, allowing 
researchers to examine the brain’s structure 
in detail in three dimensions. It could help 
researchers to connect the brain’s function 
with its structure. “I think it ll be de rigueur 
for people to have a picture showing the 
pathways and how they sit in the brain,’ 
says Richard Tsien, the neuroanatomist 
under whom Deisseroth earned his PhD 
at Stanford. 

Deisseroth takes risks and encourages 
his team to do the same, often pulling 
together people with varying expertise 
and keeping abreast of many different 
fields. Christof Koch, chief scientific officer at 
the Allen Institute for Brain Science in Seattle, 
Washington, puts Deisseroth’s success down 
to these “catholic interests”. But Deisseroth 
also has a laser-like focus on tool-building, 
fuelled, he says, by his work as a practising 
psychiatrist. And although he is excited about 
his lab’s newest technique, Deisseroth plays it 
down, shrugging a broad shoulder: “I’m not 
very demonstrative,” he says. 


TAKING CONTROL 

When Tsien, who now works at New York 
University’s Langone Medical Center, first met 
Deisseroth, he was struck by his determina- 
tion. “It was an unusual encounter — he came 
for an interview and told me rather shyly that 
the reason he came to Stanford was because he 
wanted to work with me,’ Tsien remembers. 
It was 1993 and Tsien’s lab was “crammed to 
the gills’, he says, but Deisseroth was persistent 
and persuasive. As part of a joint programme 
to earn a PhD and a medical doctorate, he 
began a project looking at calcium channels 
in neurons. In 2000, he followed that up with 
a multi-pronged appointment: a postdoctoral 
fellowship in the lab of Stanford neuroscientist 
Robert Malenka and — after toying with the 
idea of neurosurgery — a residency in psychia- 
try at Stanford Medical School. 

“Everything changed when I did my psychia- 
try rotation,” says Deisseroth. “A person can be 
right in front of you who looks intact, not obvi- 
ously injured, and yet their brain is constructing 
for them a completely different reality. At the 
same time I saw how deep the suffering was.” 

Studying depression or anxiety in a dish of 
cells was inadequate, he reasoned, because 
only whole brains can give rise to the sophis- 
ticated functions — and disorders — that 
characterize human behaviour. And tech- 
niques for studying whole brains in humans 
and model organisms were often limited to 
simply watching them at work. 

So Deisseroth began thinking about ways 
to examine and control intact systems. “I was 
having a lot of discussions with a lot of people,” 
he says. During his residency, Deisseroth met 


Boyden, a PhD student with similarly ambi- 
tious aims. The two began talking about ways 
to manipulate individual neurons as a side pro- 
ject. “It was a very adventurous collaboration, 
full of exploration,” says Boyden. 

One idea involved using light to control 
neuronal firing. Boyden and Deisseroth knew 
about light-sensitive channel proteins called 
opsins, which algae use for generating energy, 


“It was a little hard 


for people in the field 
to understand what he 


was trying to do.” 


among other functions. Several groups — 
including that of Tsien’s brother Roger at the 
University of California, San Diego — were 
trying to insert these proteins into neuron cell 
membranes. The project needed “the where- 
withal to spend the money and find the gradu- 
ate students’, says Deisseroth. In 2004, having 
secured his own lab, he could do just that. 

By July of that year, Deisseroth had man- 
aged to coax neurons into expressing opsins in 
their cellular membranes. Using cells borrowed 
from Malenka and recording equipment from 
Richard Tsien, he, Zhang and Boyden hun- 
kered down to see if the technique would fly. 
“Tt worked on pretty much the first try,’ Boyden 
recalls. “It was like riding a wave of serendipity: 

In the following year, Deisseroth finally 
secured funding from the US National Insti- 
tutes of Health (NIH) to further the work. He 
had faced multiple rejections. Projects based 
on technologies have a harder time attract- 
ing support than hypothesis-driven projects, 
says Thomas Insel, director of the US National 
Institute for Mental Health in Bethesda, Mary- 
land, the source of the initial federal funding 
for Deisseroth’s optogenetics research. “It was a 
little hard for people in the field to understand 
what he was trying to do,” says Insel. 

But neuroscientists saw the potential in 
2005, when Deisseroth’s group published its 
first big paper showing that the technique 
worked in a dish’. Researchers could now stim- 
ulate a specific type of neuron and see how that 
affected the cell’s — or even an entire animal's 
— behaviour. Requests for the technology were 
pouring in. 

The technique has since been used for every- 
thing from studying the development of neural 
stem cells’ to prompting mice to recall fearful 
memories’ (see Nature 465, 26-28; 2010). Deis- 
seroth and his team, with their focus on psy- 
chiatric conditions, have used rodent models 

to explore the network 


> NATURE.COM of brain areas that gives 
For interviews with rise to anxiety and have 
Deisseroth anda shown that one ‘hub’ 
video of CLARITY, see: controls diverse symp- 
go.nature.com/zsxdld toms such as an elevated 


© 2013 Macmillan Publishers Limited. All rights reserved 


FEATURE | NEWS 


breathing rate and feelings of panic and discom- 
fort’. They have switched on and offthe activity 
of mouse neurons that use the neurotransmitter 
dopamine, to show how they contribute to the 
symptoms of depression®. The team has even 
used optogenetics to prevent cocaine-addicted 
rats from seeking out the drug’. The work, says 
Deisseroth, could help scientists to design anti- 
anxiety drugs that are less addictive than current 
treatments — affecting anxiety pathways 
but leaving reward circuitry untouched. 

In 2006, Boyden started his own lab at 
the Massachusetts Institute of Technol- 
ogy in Cambridge. Rumours emerged 
that his relationship with Deisseroth 
was souring. Neither is keen to discuss 
the details, but in 2007, the two groups 
published separate papers on halo- 
rhodopsin, an ‘off-switch’ opsin that they had 
previously studied together*”. “We didn’t know 
Ed was working on that until we sent our paper 
to Nature,’ says Zhang. “Well you know, in any 
competitive scenario people are interested in 
being first,” says Boyden. 

The experience might have left its mark 
on Deisseroth: he is careful to attribute credit 
precisely to his team members and trainees, 
particularly with respect to CLARITY. “Karl 
is quite sensitive about who came up with the 
idea,” says Kwanghun Chung, first author of 
the CLARITY paper’. Creating see-through 
brains was Deisseroth’s dream, but the chemi- 
cal know-how came from Chung. They 
describe the finding as a co-discovery. 


SEEING CLEARLY 

Standard techniques for examining fine-scale 
brain structure involve slicing the brain into 
tissue-thin segments, analysing them under 
a microscope, and then — laboriously and 
often imprecisely — stacking the images back 
together. To bypass that process, Deisseroth 
had been looking for a means of chemically 
treating the brain to make it transparent. Some 
components of brain cells — especially lipids 
— are notoriously opaque to microscopes, so 
the idea was to get rid of them, while preserving 
the neuronal structure. It was a very different 
approach from optogenetics, and recruiting 
Chung — a chemical engineer by background 
— displayed Deisseroth’s willingness to take 
risks. “I remember people wondering what I 
was doing, what sort of plan there was in hiring 
this person and the direction the lab was going, 
because it was such a marked shift,” he says. 

At first, Deisseroth wanted to immobilize 
neurons by filling them with a suitable mate- 
rial and removing the surrounding tissue. It 
turned out to be very difficult to build a stable 
structure from inside the neurons, but it was 
easier to build one around them. 

The team tried a few scaffolding materials, 
including keratin and a cellulose-like struc- 
ture, before settling on a hydrogel — a gel made 
mainly of water and already widely used in bio- 
logical studies. The researchers found that a 
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A light touch 


Karl Deisseroth’s lab has produced two blockbuster techniques 
that allow researchers to visualize and manipulate the brain. 


Optogenetics 


Viruses are used to ferry genes encoding 
light-sensitive receptors (opsins) into 
specific neurons. 


Animals are fitted with an ‘optrode’, a 
fibre-optic cable with an electrode. 


Light beamed down the optrode will open or close 
ion channels while the electrode records neuronal 
firing and researchers record behaviours. 


hydrogel scaffold could bind components of 
neurons in place, including proteins, neuro- 
chemicals and even DNA and RNA (see ‘A 
light touch’). With the structure locked down, 
a detergent called SDS could wash away lipid 
membranes, leaving the tissue transparent (see 
Nature 496, 151; 2013). 

Within a week of publication, Deisseroth 
had already received several dozen requests for 
information. “We've been very much... I’m try- 
ing to find a verb that doesn’t make it seem like 
a problem,’ he says. He settles on describing the 
e-mails that the lab received as an “outpouring”. 

CLARITY stunned people in the same way 
that optogenetics did, says Insel. “Usually when 
someone has at an early stage of their career 
made an important advance, they tend to rest 
on those laurels. Karl quite quickly decided to 
do the next big thing” 

Other researchers will tailor CLARITY to 
their own projects. David Van Essen, a neuro- 
scientist at Washington University in St. Louis, 
Missouri, is interested in testing CLARITY in 
the brain’s white matter, the bundles of neu- 
ronal projections called axons that carry nerve 
impulses. This will help his group study pat- 
terns of connectivity that link brain regions. 

Using CLARITY with techniques such as 
optogenetics, or using it to analyse brains after 
behavioural studies, will be a powerful way to 
extract information about how brain networks 
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Proteins and other molecules in the brain 
are hybridized to a hydrogel matrix, fixing 
them in place. 


Densely packed, light-blocking lipids are 
washed away using detergents. 
The clarified brain can be stained and washed 


multiple times, allowing scientists to observe 
different molecules and cell types. 


function, says Eve Marder, a neuroscientist at 
Brandeis University in Waltham, Massachu- 
setts. But analysing large networks in detail is 
challenging. Marder usually works with simple 
circuits of 30 or so neurons, and says that even 
those have a scarily exponential number of per- 
mutations. “My only personal hope is that peo- 
ple who venture into doing circuit work in larger 
brains pay attention to the lessons learned and 
mistakes that we made along the way,’ she says. 
CLARITY is already shedding light on clini- 
cal disorders. In the paper describing the tech- 
nique’, the researchers analysed brain tissue 
from a seven-year-old boy who had autism 
spectrum disorder. They found that neurons 
in his cortex had joined together in ladder- 
like patterns, rather than the branches seen in 
typical brains. Animal models of autism-like 
conditions had hinted at this difference, but 
CLARITY made it possible to look for the 
irregularity in human brain samples. 


RISK AND REWARD 

Deisseroth’s office and much of his lab are still 
in the same basement that he moved into in 
2004, although the team — now including 35 
people — has branched out into lab space else- 
where. The labs are strewn with gadgetry, like 
the bedroom of a technology-obsessed teen- 
ager. There is no natural light in the basement, 
but Deisseroth likes it; being at the bottom of 
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the building keeps vibrations to a minimum, 
which is necessary when using the sophisticated 
microscopes essential for his work. 

The current team includes computational 
neuroscientists, medics, chemists and engi- 
neers. Anda US$22.5-million Transformative 
Research Award from the NIH, granted last 
year, allows Deisseroth to keep encouraging 
them to take chances. “I have everyone have a 
hand in a high-risk and a low-risk project,’ he 
says, “like a hedged portfolio.” 

One of the latest high-risk projects uses 
light-field microscopy, which records images 
from a variety of perspectives simultane- 
ously, to image the brain. Elsewhere in the lab, 
Chung and collaborator Viviana Gradinaru are 
still trying to build hydrogel structures inside 
neurons in the hope of preserving networks of 
specific cell types — freezing in place all the 
neurons of a certain type, for example, or all 
the cells that express a particular gene. The 
team continues to improve optogenetics, too, 
developing new opsins and perfecting meth- 
ods for delivering light to the brain. 

Deisseroth has already added space to the 
laboratory to cater for all the visiting scien- 
tists coming to learn how to use optogenetics 
and CLARITY. There were more than 200 in 
2010-12. The group has “done yeoman’s work 
in making this relatively straightforward to set 
up’, says Van Essen. “They're not going to hold 
hostage a methodology that people really can 
set up in their own lab.” The tools are freely 
available; Stanford has filed patents on some 
aspects, says Deisseroth, but that it is to ensure 
unfettered access to the technology. He does 
not make money from the patents. 

Deisseroth struggles with his decision to 
spend more time on research than with the 
patients who inspire it. He still runs a weekly 
psychiatry clinic, but has to balance this with a 
packed travel schedule and mentoring duties. 
He carries on, however, hopeful that his studies 
will ultimately benefit those that medicine can- 
not. “We can help patients right now with many 
psychiatric symptoms, he says. “But for others 
we cant. So that really helps me in mentally free- 
ing up the time needed for research” 

That desire to help is buoyed by simple, 
boundless curiosity. “T think all scientists have 
alittle bit in them,” he says. “That wonder and 
curiosity about the natural world — not only 
how it is, but how it could be.’ m 


Kerri Smith is senior audio editor for Nature 
in London. 
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* The Hubble Ultra-Deep Field shows malapee 

. as they were when the Universe was young. 
Those with the highest estimated redshifts 
(numbers and insets) were born more than 

13 billion years ago, soon after the Big Bang. 


\ ’ 


The Hubble Space Telescope is giving astronomers a glimpse 
of the Universe’s first, tumultuous era of galaxy formation. 


or one sleepless week in early September 
2009, Garth Illingworth and his team had 
the early Universe all to themselves. At NASA’s request, 
Illingworth, Rychard Bouwens and Pascal Oesch had just 
spent the previous week staring into their computer screens 
at the University of California, Santa Cruz, scanning through hundreds 
of black-and-white portraits of faint galaxies recorded in a multi-day 
time exposure by a newly installed infrared camera on the Hubble Space 
Telescope. NASA simply wanted the three astronomers to preview the 
images and make sure that the camera was working correctly, before the 
agency released the data more widely. 
But Illingworth, Bouwens and Oesch were hoping that they would 
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find more — that at least some of those smudges of 
light would prove to be among the first galaxies to 
form in the Universe, less than 1 billion years after the Big Bang. Evena 
faint glimpse of such objects could provide fresh insights into some of 
the biggest questions in cosmology, ranging from the nature of the first 
stars to the tumultuous beginnings of galaxy formation. 

That week, the astronomers began to focus on two dozen tiny candi- 
date images — each so dim and grainy that they might easily be noise in 
the camera's digital sensors. But as their analysis proceeded, it became 
clear that these patches of light had the right colour, appearing only in 
the camera's reddest filters — exactly what would be expected of new- 
born galaxies seen at a very great distance and very high redshift. And 
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when the three colleagues started digitally adding together exposures 
of each candidate, says Illingworth, “suddenly there they were” — fuzzy, 
but undeniable images of galaxies. “That week in September was one of 
the most exciting times of my career!” 

By the week’s end, he, Bouwens and Oesch had posted two draft papers 
to the arXiv preprint server’, detailing their first-ever collection of more 
than 20 galaxies from the age of galaxy formation, some 13 billion years 
ago, when the cosmos was only 600 million to 800 million years old. Since 
then, other researchers have made further observations of the same small 
patch of sky, known as the Hubble Ultra-Deep Field (HUDF), and four 
other larger regions. They have expanded that initial roster to some 1,400 
young galaxies, from the same era. 

The data from this growing catalogue are already hinting at a still- 
unseen time — an infant Universe thronged with countless small gal- 
axies and lit by primordial stars so massive that they burned out and 
blew up in a cosmic eye-blink. And a new generation of instruments 
promises to bring that era into clear view. They include the Atacama 
Large Millimeter/submillimeter Array (ALMA) of radio telescopes in 
Chile, which is already beginning such observations; and Hubble’s suc- 
cessor, the infrared James Webb Space Telescope (JWST), which is set 
for launch in late 2018. 

It's a heady time for early- Universe astronomers, says cosmologist 
Avi Loeb of Harvard University in Cambridge, Massachusetts. “We're 
looking at our origins,” he says. “The first galaxies were the building 
blocks of the Milky Way, and the desire to understand them is a search 
for our roots.” 


DEEP BACKGROUND 

Over the past few decades, observers have developed a general storyline 
describing how galaxies formed (see ‘Dawn’ early light’). Astronomers 
know, for example, that the raw material was a hot, ionized plasma of 
hydrogen and helium that emerged from the Big Bang, then rapidly 
cooled as the Universe expanded. Once its temperature had fallen far 
enough, about 370,000 years after the Big Bang, protons and electrons 
combined to make neutral atoms and created a light-absorbing haze that 
plunged the Universe into a cosmic ‘dark ages’ 

Astronomers also know that this cosmic haze was almost perfectly 
uniform at the start — but immediately began to clump together as grav- 
ity began to magnify slight fluctuations in the 
material’s density. And they are reasonably 
sure that, after several hundred million years, 
the densest of the growing clumps began to 
form the first stars, which ignited by thermo- 
nuclear fusion and reionized the neutral gas 
that remained. The veil of gas became a trans- 
parent plasma again, bringing the cosmic dark 
ages to a spectacular end (see Nature 490, 
24-27; 2012). 

But from this point onwards, very little is 
certain. The formation of succeeding genera- 
tions of stars and galaxies was a swirling chaos of heating and cooling gas 
clouds, detonating supernovae, black-hole accretion and fierce stellar 
winds strong enough to eject matter from small galaxies — a process far 
too messy and complex to understand without extensive observations. 

Such observations are a major goal of the HUDF project, which 
aims to gather enough images of distant galaxies to discern patterns 
in their sizes, shapes and colours. Located south of Orion in the con- 
stellation Fornax, and measuring just one-tenth of the diameter of the 
full Moon as seen from the ground, the HUDF is an otherwise typical 
patch of dark sky that happens to be relatively devoid of foreground stars 

and galaxies. But just as astronomers expected, 


> NATURE.COM an 11.3-day time exposure of the field taken by 
To watcha Hubble in late 2003 and early 2004 revealed that 
simulation of it was, in fact, filled with a multitude of faraway 
galaxies forming see: galaxies seen as they were billions of years ago. 

go.nature.com/Izidpf In August and September 2009, the field was 


“SUDDENLY THERE 
THEY WERE — FUZZY, 
BUT UNDENIABLE 
IMAGES OF GALAXIES.” 
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re-examined in an additional two-day exposure taken by Hubble’s Wide 
Field Camera 3 (WFC3): an instrument installed by astronauts the pre- 
vious May that is exquisitely sensitive at infrared wavelengths — exactly 
where visible and ultraviolet light from the farthest galaxies is expected 
to end up after being redshifted by the cosmic expansion. 

These were the images that Illingworth, Bouwens and Oesch saw. 
Knowing that the WFC3 could detect distant galaxies about 30 times 
fainter than its predecessor could, or about 4 billion times fainter than 
anything visible to the human eye, the astronomers initially thought 
that they might have caught one of the very first generations of galaxies 
in the act of being born. When they estimated the objects’ distance and 
composition by examining their colours in three different filters — the 
faint smudges were far too dim for Hubble to get a spectrum — the team 
found that they were relatively blue, exactly as expected of extremely 
young galaxies glimpsed in their first frenzy of star formation. 

But this conclusion was far from ironclad. Testing the idea was a 
prime motivation for a team of astronomers led by Richard Ellis at the 
California Institute of Technology in Pasadena. In 2012 they re-exam- 
ined a small part of the centre of the HUDE this time with an additional 
colour filter and a time exposure totalling about 23 days. 

These newer observations, which Ellis’s team reported in January this 
year at a meeting of the American Astronomical Society in Long Beach, 
California*”, reveal that the galaxies are in fact redder, and therefore 
contain older stars, than initially calculated. The very youngest galax- 
ies that Hubble has identified, imaged as they appeared 560 million to 
780 million years after the Big Bang, contain stars that are 100 million 
to 200 million years old. So these galaxies had already been around for 
at least that long. 

The new HUDF observations also reveal puzzling features of the 
tumultuous era of reionization, as explained at the January meeting 
by Brant Robertson of the University of Arizona in Tucson. This was 
the time when the first galaxies were growing bigger and more numer- 
ous, and when ultraviolet light from the first stars was becoming strong 
enough to ionize the veil of thick hydrogen gas that enveloped them. 
Other observations show that reionization began roughly 250 million 
years after the Big Bang, and that it was complete at a cosmic age of 
roughly 1 billion years — at which point starlight could stream freely 
into space and the cosmos was mostly transparent, just as we see it today. 

But although the galaxies Hubble saw in 
the 2012 (and the 2009) HUDF observations 
were presumably the largest and brightest 
ones around all those billions of years ago, 
there simply were not enough of them to 
reionize the Universe. This means, according 
to Ellis, Robertson and their colleagues, that 
there must have been a large population of 
unseen small fry that did most of the work — 
a conclusion also reached by Illingworth and 
his team’. 

“We now know there’s a whole popula- 
tion of small galaxies at even earlier times” than Hubble's detectors can 
record, says Ellis — which leads to an exciting set of questions for the 
newer telescopes such as ALMA and JWST, including how these bodies 
formed and how they coalesced into the larger galaxies that came later. 

Another set of questions relates to the very first generation of stars, 
which coalesced from almost pure helium and hydrogen forged in the 
Big Bang. Theory suggests that they were more than 100 times as mas- 
sive as our Sun — far larger than any stars that form today. If so, then 
theory also suggests that these monsters were so short-lived that none 
of them would have survived in the galaxies that Hubble can see. Their 
extreme size would have caused these stars to destroy themselves in 
spectacular supernova explosions after only some 2 million years. But 
did they? And did their death throes delay the birth of the next genera- 
tion of stars, by disrupting the thick interstellar gas clouds in which new 
stars were forming? 

The HUDF data already suggest that the answer to the last question 
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DAWN'S EARLY LIGHT 


The Big Bang generated a haze of primordial hydrogen and helium. 
After about 370,000 years, cosmic expansion cooled it eno 
electrons and protons to form neutral atoms (‘recombination’). 
During the resulting ‘dark ages’, when the Universe had no stars to 
light it, gravity magnified slight fluctuations in density and caused 
the neutral gas to clump. The densest clumps eventually became 
the first stars and galaxies. Their ultraviolet light ionized the 
remaining neutral gas (‘reionization’), allowed light to stream 

freely into space, and brought the dark ages to an end. 


Billions of years ago 


Neutral 
hydrogen 
and 
helium 


Cosmic 
‘dark ages’ 


~A 
First stars 


Recombination 


is ‘No, says theorist Volker Bromm at the University of Texas at Austin, 
who was not involved in the 2009 and 2012 HUDF studies. Because the 
colours of the galaxies seen in the ultra-deep field indicate that they had 
been forming stars for at least 100 million years already, this suggests 
that there was little or no lag between the death of the very first genera- 
tion of stars and the birth of the second, he says. The generations may 
even have overlapped. But untangling exactly what happened will be a 
job for future telescopes. 


THE NEXT FRONTIER 

In the meantime, Hubble astronomers haven't been idle. NASA is pursu- 
ing a tactic that could turn the observatory into a telescope as powerful 
as the JWST will be, along some limited fields of view. 

To achieve this, astronomers are scanning the heavens to select six 
fields of view that, unlike the HUDE each contains a high-mass cluster 
of foreground galaxies. As first predicted by Einstein, such clusters act as 
cosmic zoom lenses, gravitationally magnifying and brightening images 
of distant galaxies that lie directly behind them. 

Hubble's visible-light and infrared cameras will take turns at look- 
ing through these lenses, which should reveal distant galaxies 10 to 50 
times fainter than any previously known — among them the multitude 
of small fry whose existence is indicated by the reionization data. Col- 
lection of data from the first four ‘frontier fields’ is scheduled to be 
completed over the next two years, says Mark Dickinson of the National 
Optical Astronomy Observatory in Tucson, Arizona. 

In Chile, ALMA will join the hunt for distant galaxies from this summer 
(see Nature 495, 156-159; 2013). In contrast to Hubble, which records 
starlight, ALMA’s microwave measurements will reveal the gas and dust 
that gives rise to stars in these remote bodies. Paradoxically, says James 
Dunlop of the University of Edinburgh, UK, a member of the 2012 HUDF 
team, this will allow ALMA to make the most accurate measurement yet 
of starbirth at such distances. Newborn stars radiate most of their light 
at ultraviolet wavelengths, he explains, but much of that light is absorbed 
by gas and dust and reradiated at infrared wavelengths, which are then 
redshifted by cosmic expansion into ALMAs millimetre range. 

In addition, ALMA’ high spatial resolution will enable the array to 
break radio emissions into their component wavelengths and therefore 
record the actual redshifts — bona fide measurements of distance — of 
many of the remote galaxies that Hubble has studied, says Chris Carilli 
at the National Radio Astronomy Observatory in Socorro, New Mexico. 


556 | NATURE | VOL 497 | 30 MAY 2013 


13.4 


HUBBLE 
2012 


v 


-——_—_— Reionization ———__ 4 


* 


a~ 
First galaxies 


Astronomers can then translate those distance measurements into ages, 
which will give them a much better handle on where these objects fit 
in cosmic history. 

“Hubble has been amazing at finding candidate galaxies from red- 
shift 7 to 10, but none of these has been confirmed with spectra and the 
potential for [spurious candidates] is severe,” says Carilli. 

Carilli and his collaborators reported’ in February that ALMA can 
measure redshift-7 galaxies (objects 3,955 megaparsecs or 12.9 billion 
light years from Earth) using just 20 ofits eventual complement of 66 
antennas. A report from another team in Nature® provides further evi- 
dence. ALMA will “quickly make the jump to redshift 8” by the end 
of the year, says Carilli, and if the array gets a new set of receivers — a 
possibility still several years in the future — it could study and measure 
distances to galaxies out to redshift 11. These objects would be seen 
as they appeared just 425 million years after the birth of the Universe. 
ALMA could become the “redshift machine of choice” for the first 
galaxies, he says. 

Nevertheless, most astronomers are eagerly awaiting the 6.5-metre 
JWST, whose raison d’étre is to image the faint, primitive bodies that 
Hubble can only glimpse — but that were the earliest ancestors of mod- 
ern galaxies such as the Milky Way (see Nature 467, 1028-1030; 2010). 
Hubble's observations provided “the first hints of the first galaxies”, says 
Ellis, but “we really need the JWST to push back into that even earlier 
period from 200 million years to 500 million years after the Big Bang”. 

Back in 2009, even as NASA astronomer and astronaut John Gruns- 
feld glimpsed the first images of distant galaxies from the infrared 
camera he had helped to install on Hubble, the JWST had come to his 
mind. “I couldn't help but feel awed by the power of Hubble,” Grunsfeld 
recalls, but “the views of the HUDF also gave me great satisfaction that 
the JWST would have plenty to see.” m 


Ron Cowen is a freelance writer in Silver Spring, Maryland. 


. Bouwens, R. J. et al. http://arxiv.org/abs/0909.1803 (2009). 

. Oesch, P.A. et al. http://arxiv.org/abs/0909.1806 (2009). 

. McLure, R. J. et a/. Mon. Not. R. Astron. Soc. http://dx.doi.org/10.1093/mnras/ 
stt627 (2013). 

. Robertson, B. E. et al. Astrophys. J. 768, 71 (2013). 

. Schenker, M.A. et al. Astrophys. J. 768, 196 (2013). 

. Bouwens, R. J. et al. Astrophys. J. Lett. 752, L5 (2012). 

. Wang, R. et al. http://arxiv.org/abs/1302.4154 (2013). 

Vieira, J. D. et al. Nature 495, 344 (2013). 


WHr 


ONDA 


© 2013 Macmillan Publishers Limited. All rights reserved 


ILLUSTRATION: NIK SPENCER; SOURCES: NASA/WMAP SCIENCE TEAM; R. ELLIS (CALTECH) 


ILLUSTRATION BY DALE MURRAY 


COMMEN 


oo 
a) 
Street lights that A life of Nikola Astronomer at i = hal Time for biologists to 
use less energy are better for Tesla — engineer, inventor Royal Observatory talks \. “ venture beyond the central 
people and animals p.560 and showman p.562 celestial images p.564 i Z dogma p.565 


The fourth age of research 


Jonathan Adams analyses papers from the past three decades and finds that 
the best science comes from international collaboration. 


esearch has progressed through three 
Re the individual, the institutional 
and the national. Nations competed 
to be at the cutting edge because this con- 
tributed to the wider economy through 
knowledge, new processes and products. 
Today, we are entering a fourth age of 
research, driven by international collabora- 
tions between elite research groups. This will 
challenge the ability of nations to conserve 
their scientific wealth either as intellectual 
property or as research talent’. Tensions are 


growing: between the knowledge a country 
needs to remain competitive and the assets 
it can exclusively secure, and between the 
collaborative and domestic parts of the 
research base. Institutions that do not form 
international collaborations risk progressive 
disenfranchisement, and countries that do 
not nurture their talent will lose out entirely. 

To explore the scale of this challenge, 
I analysed data on research articles and 
reviews from Thomson Reuters Web of 
Science between 1981 and 2012. Iincluded 
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papers in a country’s tally if one or more 
author addresses included that country, but 
no weighting was applied for the number of 
authors or addresses on each paper. For the 
25 million papers I included in my analy- 
sis, I calculated the balance of international 
and domestic research collaboration for 
established economies (the United States, 
the United Kingdom, Germany, France, 
the Netherlands, Switzerland) and emerg- 
ing ones (China, India, South Korea, Brazil, 
Poland). I looked particularly at all 
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STRENGTH IN NUMBERS 


Growth in international collaboration eclipses domestic output 
in established economies, but not in emerging ones. 


UNITED STATES 
The country is less internationally 
collaborative than those in Western Europe. 
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CHINA 
More than three-quarters of research 
output remains domestic. 
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> papers from the United Kingdom and 
the United States that listed multiple authors. 
Then I probed the relative citation impact of 
domestic and international research in the 
two countries. 

I found that, over more than three dec- 
ades, domestic output — papers that list 
only authors from the home country — has 
flatlined in the United States and in West- 
ern European countries. The rise in total 
annual output for each country is due to 
international collaboration. As a result, the 
percentage of papers that are entirely ‘home 
grown is falling. In emerging economies, by 
contrast, domestic output is rapidly expand- 
ing (see ‘Strength in numbers’). 

These shifts stand to change the dynamics 
of how nations invest in science, who owns 
its outputs and who best exploits them. 


BORDER CROSSING 

In Western Europe in 1981, domestic share of 
output was about five papers in every six pub- 
lished by each country, but fell to half of total 
output in 2011. The United States started as 
less collaborative — just 6% of US papers in 
1981 had a foreign co-author, but it too has 
had a marked fall in domestic share of output: 
it is the most frequent partner for many other 
nations. Nonetheless, it remains less interna- 
tionally collaborative than Western Europe. 

In emerging economies in 1981, the 
domestic share of output was similar to that of 
the established economies. Today, about 75% 
of the research output of China, Brazil, India 
and South Korea remains entirely domestic. 
The total volume of papers from these four 
countries has increased 20-fold — from fewer 
than 15,000 papers annually in 1981 to more 
than 300,000 papers now. Despite its Euro- 
pean location, Poland’s balance of domestic 
and collaborative papers is closer to this group 
(see Supplementary Information at go.nature. 
com/nszeck). Quantity is now being matched 
by significant improvements in quality: the 
citation impact of more than 10% of China’s 
domestic research is already above twice the 
world average. 

For established economies, total national 
research output has more than doubled over 
the past 30 years. Meanwhile, domestic out- 
put has increased by only about 50% for all 
except the Netherlands (which more than 
doubled). After the mid-1990s, the domes- 
tic research output of the United Kingdom 
(47,500 papers per year), Germany (45,000 
papers) and France (30,000 papers) levelled 
off while international collaboration in these 
countries increased more than ten-fold. 

What fuelled this growth in collabora- 
tion? Much of it has come from bilateral 
partnerships rather than multinational pro- 
grammes (see Supplementary Information). 
For instance, less than 1% of UK papers from 
2003 to 2012 list co-authors from all three of 
France, Germany and the United States, and 
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less than 0.1% of US output over the same 
period has co-authors from the United King- 
dom, Germany and China together. 

Switzerland is an interesting outlier. For 
asmall country, its research output has seen 
above-average growth and a high average cita- 
tion impact. Today, more than two-thirds of 
its publications have a foreign co-author. A 
large part of this comes from the country’s 
hosting of multinational programmes, such as 
CERN (Europe's particle-physics lab) and the 
World Health Organization, both in Geneva. 

Citation impact is typically greater when 
research groups collaborate, and the benefit 
strengthens when co-authorship is interna- 
tional*®. I calculated the citation impact of 
each UK and US paper published in 2001 
and then for 2011 by normalizing citation 
count to take into account the time elapsed 
since year of publication and the subject area 
of the journal. For each of these years, I then 
calculated an average normalized citation 
impact (where 1.0 is the world average) for 
papers that were purely domestic and for 
papers with a foreign co-author. 

For both countries, papers that are inter- 
nationally collaborative are cited relatively 
more often than purely domestic papers. 
And this ‘impact premium’ rose in both 
countries by around 20% between 2001 and 
2011 (see ‘Impact premium). 


ACASE IN POINT 

A closer look at the United Kingdom reveals 
the drivers of some of these changes. The 
UK government's Department for Business, 
Innovation & Skills (BIS) reports both rising 
volume and rising average citation impact 
for the UK research base relative to the world 
average. My analysis shows that this impact 
(up from 1.21 in 2001 to 1.47 by 2011) is 
mainly attributable to the near-doubling of 
its international collaboration (from 26,608 
to 50,423 papers with an average impact up 
from 1.48 to 1.72). By comparison, the UK 
domestic research base, which BIS has never 
analysed separately, has changed much less 
in quantity and quality. In 2001, the country 
published 46,671 domestic papers with an 
average impact of 1.05; in 2011 it published 
47,323 domestic papers with an average 
impact of 1.21. And in 2010, the number of 
UK papers with no overseas co-author actu- 
ally fell below the count of internationally 
collaborative papers. 

The benefits of collaboration are sys- 
tematic. I found that 130 UK universi- 
ties and colleges published more than 100 
research articles and reviews in journals 
indexed on Thomson Reuters Web of Sci- 
ence in 2002-11. Around half of these uni- 
versities have an international co-author 
on less than 40% of their papers. Average 
impact in this group rarely gets much above 
1.1. By contrast, research staff in institu- 
tions with greater than 50% international 


IMPACT PREMIUM 


In the United States (US) and the United Kingdom (Uk), papers with at least one author from 
another country are cited more often than purely domestic work. 


Relative average citation impact 


US 2001 


US 2011 


co-authorship on their papers had an 
average citation impact above 1.6 (see 
Supplementary Information). This category 
includes the five or six UK universities that 
are generally recognized as standing ahead 
of others, including the universities of 
Oxford and Cambridge. 

This same handful of elite institutions 
receives about one-third of all available 
core research income from the UK higher- 

education funding 


“Elite national councils, more than 
universities one-third of the total 
are also available research 
leading grant and contract 
international income and one-third 


ofall research-council 
studentships. By con- 
trast, the 65 or so UK universities that have 
an international co-author on less than 40% of 
their papers collectively account for less than 
10% of funding-council research money, less 
than 5% of research grants and contracts and 
less than 1% of research-council studentships 
(my analysis; using data from the UK Higher 
Education Statistics Agency). 


collaborators.” 


CONTEMPORARY GIANTS 
First-rate researchers are, and always were, 
thinly distributed. In the past they could col- 
laborate only intermittently. Contemporary 
giants call more readily across the desolate 
intervals of space (to paraphrase Friedrich 
Nietzsche) and rapidly receive answers, 
because information technology has sharply 
reduced the costs of collaboration’. 
Excellence seeks excellence, so elite 
national universities are also leading inter- 
national collaborators. Exceptional research 
groups share ideas, resources and outcomes. 
For example, the most frequent international 
partners of the University of Cambridge, UK, 
are the Max Planck institutes in Germany, the 
Massachusetts Institute of Technology and 
Harvard University, both in Cambridge; the 
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UK 2001 UK 2011 


California Institute of Technology in Pasa- 
dena; the University of California, Berkeley; 
and the universities of Toronto, Heidelberg 
and Tokyo. Harvard’s frequent international 
partners are Imperial College London, Uni- 
versity College London, the Max Planck insti- 
tutes, the Karolinska Institute in Stockholm 
and the universities of Cambridge, Toronto 
and Geneva. Internationally co-authored 
papers are more highly cited because the 
authors are more likely to be doing excellent 
research. Multinational programmes are still 
marginal as drivers of performance. 

So, in this fourth age of research there is 
a growing divide between international and 
domestic research. This will influence each 
nation’s ability to draw on the global knowl- 
edge base, and could in turn compromise 
national scientific wealth. For established 
economies, the risks are a gradual financial 
and intellectual separation between insti- 
tutions that are largely international and 
those that are largely national. Such separa- 
tion could lead to the erosion of adequate 
regional competency for future research 
training and collaboration and for knowl- 
edge flow to the national industrial base. 

Government policy should focus on three 
issues to address these risks. First, in this age 
of big data that are internationally shared, the 
question will be who has the skills to exploit 
knowledge assets fastest, not who owns them. 
Shared knowledge and discovery sideline 
the idea of securing intellectual property by 
commercial confidentiality and patenting. 
The growing scarcity of truly able research- 
ers could lead to a ‘global war for talent’®. So 
governments must provide and sustain con- 
ditions that attract and support the best sci- 
entists, otherwise talent flows elsewhere and 
there is no capacity to generate replacements. 

Second, incentives must be put in place 
to enable universities to participate in inter- 
national networks. For example, tangible 
projects involving participation couldbe > 
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> given explicit recognition and credit 
in systems for assessing research qual- 
ity (such as the UK Research Excellence 
Framework). Insight into the evolving 
research of the emerging economies will 
be limited without active engagement 
and collaboration. Relying simply on 
what is published in journal articles has 
long been recognized as a poor mode of 
knowledge transfer’. 

Third, collaboration must include 
European and US researchers in Asian 
and South American laboratories. The 
older economies can no longer rely on 
the best foreign researchers to come and 
visit. Maintaining a talent pool is neces- 
sary, but it is not sufficient: if a nation 
really wants to remain at the highest level 
scientifically, it needs to get its talent out 
and about — and then bring it back again. 
One way of doing this is to create worth- 
while travel and exchange programmes 
that include an assured post for the 
returning researcher. The United King- 
dom has a shocking record on mobility 
in European research programmes and 
the United States has the lowest level of 
international collaboration among the 
G7 countries. By contrast, the Nether- 
lands achieves excellence by enabling 
its researchers to be much more mobile, 
despite its lesser resources’®. 

Impact and innovation will flow from a 
coalition of the willing, not the straitjacket 
of international policy and coordination. 
Multinational programmes are not the 
answer, nor are academic memoranda of 
understanding. To maintain the dividend 
that governments garner from research 
excellence, they must ensure that uni- 
versities and their researchers have the 
resources, facilities and incentives to create 
and sustain flourishing partnerships. m 


Jonathan Adams was formerly director 
of research evaluation at Thomson 
Reuters and is now a research- 
management consultant in Leeds, UK. 
e-mail: jonathanzadams@gmail.com 
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Wasted light from street lights, such as those in New York City, contributes to skyglow. 


A green light 
for efficiency 


Efforts to improve street lights are providing a rare 
opportunity to cut both financial and environmental 
costs, argues Kevin Gaston. 


ver the past few years, local govern- 

ments in towns and cities across the 

world have been dimming street 
lights, turning them off for parts of the night, 
investing in more energy-efficient technolo- 
gies and gradually modernizing old lighting 
systems. Hard times and tight budgets have 
helped to drive this flurry of activity. 

The combination of financial pressures, a 
broad array of new lighting technologies and 
aricher understanding of the importance of 
natural-light cycles to the health of many 
organisms presents an unusual opportunity. 
To take advantage, environmental scientists 
must work quickly to gather and build on 
existing knowledge about the effects of 
artificial light on plants and animals, and 
must collaborate with those responsible for 
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the development and implementation of 
public lighting schemes. 

As well as disrupting the development, 
physiology and behaviour of many organ- 
isms, grid-based electric lighting is esti- 
mated to produce 1.5 billion tonnes of 
carbon dioxide each year globally. Moreo- 
ver, artificial light disturbs people’s natural 
circadian rhythms. Various studies have 
linked this disturbance to conditions such 
as breast cancer’ and obesity. A notorious 
impediment to astronomical observations, 
artificial light also robs us of moonlit land- 
scapes and properly dark or star-filled skies. 

Most existing street-lighting systems are 
inefficient and expensive. Much of the light 
they produce is wasted, contributing to sky- 
glow often tens and possibly hundreds of 
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kilometres away. Historically, some light- 
ing systems were installed not to aid people, 
but because they were thought to add to the 
beauty ofa city, or because they could be run 
at little extra cost by using excess electricity 
produced during periods of low demand. 
Likewise, the illumination of many roads 
has not been adjusted to account for the fact 
that headlights on modern vehicles are now 
some eight times brighter than those of cars 
and trucks built 60 years ago. 


DIM THE LIGHTS 

A global assessment’ of artificial lighting, 
carried out in 2006, estimated that street 
lighting consumes about 114 terawatt hours 
of electricity globally each year. For compari- 
son, this is almost 400 times the electricity 
that New York City consumes for street 
lighting annually. National and local gov- 
ernments in the United States, Europe and 
Asia have quickly recognized the potential 
for savings — not just by dimming and 
turning off street lights, but also by install- 
ing energy-efficient light-emitting diodes 
(LEDs) instead of conventional bulbs. As the 
cost of new technologies continues to fall, 
such efforts will increase. 

There are various ways to cut the 
energy used by street lights. These gener- 
ally involve reducing the time or extent to 
which lights are used, and their intensity**, 
Savings depend on the size and type of 


existing lighting systems, their purpose and 
whether systems are being newly installed 
or upgraded. If new roads, towns or cities 
are being built, for instance, investing in low- 
energy and centrally manageable lighting 
makes sense. The feasibility of retrofitting 
existing developments depends in part on 
how long it takes for the long-term savings in 
energy costs to cover the expense of replace- 
ment. For LED technology, this is currently 
about 10-20 years. 

Those responsible for modifying light- 
ing systems should consider other factors 
too. A nuanced approach would ensure that 
lighting systems do the least environmental 
harm, as well as providing the necessary 
benefits at minimal expense. 

Artificial light can influence myriad 
biological processes, including sleep, 
metabolism, germination and flowering. It 
can alter the distribution and abundance of 
organisms and disrupt predator-prey rela- 
tionships”. In fact, such effects are probably 
pervasive, given the importance of natural- 
light cycles on circadian clocks and organ- 
isms’ physiological responses to day length. 

In general, limiting the number of street 
lights, dimming them and turning them on 
for shorter periods reduces these negative 
biological effects®. But studies are needed to 
identify the best compromises. People are 
most in need of artificial light in the hours 
after dusk and before dawn — just when 
other species are most likely to perceive the 
day to be extended (if diurnal) or curtailed 
(if nocturnal). Adjusting these periods to 
better suit the needs of local ecology, within 
the constraints of human requirements, 
could reduce the negative impact of night- 
time lighting all round. 

White-light LED systems and other 
white-light technologies are proving popular 
because they improve colour rendering for 
humans — objects look more as they would 
when viewed in natural light. But increasing 
the range of wavelengths ofa light source also 
increases the overlap between the emitted 
light and the spectral sensitivities of a broad 
array of other organisms. Some white lights 
emit a lot of ultraviolet, making them more 
attractive to insects such as moths. These 
technologies should be limited to areas where 
they will be most useful and ecologically least 
destructive, such as in urban rather than rural 
settings. Parts of the light spectrum that are 
not useful to people should be avoided. 


EFFICIENCY AND ECOLOGY 

In Europe, major projects to improve the evi- 
dence base for ecologists’ recommendations 
for lighting schemes are under way. These 
include the ECOLIGHT project, an investi- 
gation of the ecological effects of light pollu- 
tion, which I lead at the University of Exeter, 
UK. The experimental components of this 
work need to be expanded to investigate the 
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environmental effects of night-time lighting 
in a sufficiently broad array of ecosystems 
and taxa. 

To ensure that everyone wins from the 
growing worldwide movement to reduce 
night-time lighting, a three-way dialogue 

is vital between those 


“Artificial investing in new 
° lighting technologies, 
Hie bed those producing them 
pe eee and environmental 
eee ek hole scientists. My group 
7 Hesig and I have found that 
germination discussions with local 
and : 7 authorities enable us 
flowering : to better envision and 


articulate how social 
challenges can be weighed against environ- 
mental ones. Equally, our findings offer such 
authorities more leverage to bring about 
changes. 

Through these discussions I have come to 
believe that the biggest barrier that local and 
national governments face in their efforts to 
reduce night-time lighting is the public per- 
ception of its importance in reducing vehicle 
accidents and crime. Unsurprisingly, almost 
all of the evidence connecting the importance 
of lighting to safety and security comes from 
experimental studies conducted in extremely 
low-risk areas, or correlational studies that 
are notoriously difficult to interpret”®. 

Determining the effects of lighting is also 
complicated by the fact that people alter their 
behaviour in response to lighting changes’. 
Brighter street lights, for example, can make 
people feel more confident and therefore 
induce them to drive faster. Thus, in addi- 
tion to studies on the ecological impact of 
different lighting regimes, research is needed 
to investigate their societal impact. 

This opportunity to install street light- 
ing that uses less energy and is better for 
human, animal and plant health must be 
grasped now. It may not arise again for many 
decades. = 


Kevin J. Gaston is director of the 
Environment and Sustainability Institute, 
University of Exeter, Penryn, UK. 

e-mail: k.j.gaston@exeter.ac.uk 
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Artificially generated lightning crackles around Nikola Tesla’s Colorado laboratory. 


The mind electric 


Patrick McCray assesses a biography of Nikola Tesla, 
the Serbian wizard of the alternating current. 


hen entrepreneur Elon Musk 
named his all-electric car com- 
pany Tesla Motors, he was pay- 


ing homage to a remarkable man. Serbian 
inventor and electrical engineer Nikola 
Tesla (1856-1943) created a veritable zoo of 
electrical inventions, from motors that used 
alternating current (AC) to radio-controlled 
boats, and a proposed system for the wireless 
transmission of electricity from one conti- 
nent to another. 

Bernard Carlson's superb biography fol- 
lows Tesla from his years in what is now Cro- 
atia, Austria and Hungary, where he studied 
physics, engineering and mathematics, to his 
1884 arrival in New York City and then on to 
spectacular successes and failures in electri- 
cal innovation. Carlson brings to life Tesla’s 
extravagant self-promotion, as well as his 
eccentricity and innate talents, revealing him 
as a celebrity-inventor of the ‘second indus- 
trial revolution to rival Thomas Alva Edison. 

Tesla worked briefly for Edison in the 
United States, but quit in disgust when Edi- 
son declined to use his arc-lighting system. 
Financially strapped, Tesla plunged into the 
technological ecosystem of New York, with its 
growing demand for electricity for the power 
and communications industries. As Carlson 
relates, these technological frontiers were 
largely unsettled at the time, with inventors, 
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entrepreneurs and financiers all fighting to 
secure a competitive advantage. A key ques- 
tion was whether Edison's direct-current 
approach or the AC option favoured by Tesla 
would come to dominate power transmission. 

Tesla’s signal accomplishment was the 
realization of his AC motor in the late 1880s. 
This was based on a series of inventions and 
patents for using two sources of alternating 
current out of phase with one another. Tesla 
saw that these created a rotating magnetic 
field that could make a motor. His polyphase 
power systems were backed by funds from US 
entrepreneur George Westinghouse and ulti- 
mately allowed utilities to transmit electrical 
power over longer distances than before. 

The first biography on Tesla appeared in 
1894, when he was riding high after build- 
ing an “oscillating transmitter” (a resonant 
transformer also called the Tesla coil) and 
seeing his polyphase AC motors deployed 
for power generation at Niagara Falls. Pub- 
lic lectures in the United States and Europe 
followed. Tesla had a consummate ability to 
impress audiences and 


potential backers with Tesla: Inventor of 


stunning electrical dis- peaeriiclieg Age 
plays. An 1891 lecture, ices 


for instance, featured 
two large zinc sheets 
suspended from the 
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auditorium ceiling and connected to a power 
source. Under dim lighting, Tesla stepped 
between the slabs holding a gas-filled tube in 
each hand. The electrostatic field made the 
tubes glow as Tesla told the enthralled crowd 
how electric lights might be moved around 
without being tethered to wires. 

Tesla further studied the possibility of 
wireless power transmission in 1899, while 
on sabbatical in Colorado Springs. (A well- 
known double-exposure photograph, pic- 
tured, in which he seems to sit nonchalantly 
amid fierce electrical discharges, was taken 
here.) The region’s summer storms led him 
to surmise that lightning bolts initiated elec- 
tromagnetic waves in Earth's crust, creating 
stationary waves. He believed that this pro- 
cess would allow power transmission “in 
unlimited amounts, to any terrestrial dis- 
tance and without loss”. 

Many writers have cast Tesla as a “Nietzs- 
chean superman’, as Carlson puts it. But 
Carlson looks critically at Tesla’s wilder 
claims — suchas his proposed particle-beam 
weapon, which never came to fruition despite 
stirring up interest among Soviet, British and 
US officials on the eve of the Second World 
War. Carlson is also frank about Tesla’s mis- 
understanding of scientific discoveries. In 
1887, for example, German physicist Heinrich 
Hertz detected the electromagnetic waves 
predicted by Scottish physicist James Clerk 
Maxwell. Tesla decided that the glow in evac- 
uated glass discharge tubes was due to “elec- 
trostatic thrusts”. The error did not prevent 
him from trying to convert Hertz’s basic dis- 
covery into devices in the lab. Technological 
innovation, not scientific discovery, remained 
this visionary engineer's primary goal. 

Carlson contextualizes Tesla’s approach 
with excursions into the nature and psychol- 
ogy of invention, exploring theories such as 
economist Joseph Schumpeter’s model of 
“creative destruction’ and business profes- 
sor Clayton Christensen’s more recent idea of 
“disruptive innovation”. Both these theories 
are based on the idea that entrepreneurs and 
inventors who produce radical technologies 
can cause widespread social and economic 
disruption. Tesla’s inventive style, says Carl- 
son, existed in “tension between ideal and 
illusion”: he first shaped inventions in his 
mind rather than taking Edison’s empirical 
approach. Tesla believed that his polyphase 
system rested on a beautiful principle, which 
he expected businessmen and custom- 
ers to adapt to. However, putting design 
ideals ahead of practical considerations — 
not unlike Apple’s late chief executive Steve 
Jobs, Carlson notes — sometimes meant that 
Tesla missed out on commercial possibilities. 

In about 1900, Tesla began to conceptualize 
a grand system that would enable the wire- 
less transmission of power and communica- 
tions “from Pike’s Peak” in Colorado's Rocky 
Mountains “to Paris” Buoyed by support from 
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Wall Street wizard John Pierpont Morgan, he 
oversaw the building of Wardenclyffe, an 
imposing laboratory-cum-transmitting tower 
on Long Island, New York. However, faulty 
research results undermined the plan and, 
unnerved by Tesla’s self-promotion and sus- 
picious of the growing speculative bubble in 
wireless communications, Morgan withdrew. 
The giant transmission facility was never 
completed and Tesla found himself scram- 
bling for cash. (Despite this, Tesla’s activi- 
ties probably spurred physicist and inventor 
Guglielmo Marconi to speed up his own work 
on wireless communications.) 

The failures that dogged Tesla’s transmis- 
sion scheme bit deeply into his perceptions 
of how the electrical world worked. Earth 
didn’t behave as if it were filled with an incom- 
pressible fluid, as Tesla believed. When the 
Wardenclyffe experiment failed, Carlson 
recounts, Tesla faced a “serious dilemma ... 
Either he was wrong or nature was wrong.” 
Ideas clashed with reality and Tesla, angry and 
depressed, had a nervous breakdown in 1905. 

Tesla’s last three decades in New York City 
were spent in relative obscurity. He never 
gave up the dream of wireless power trans- 
mission and continued to invent while living 
precariously offa modest stream of royalties. 
He gave annual press conferences in which 
he speculated about the future of technology. 
His life, one observer noted, took on a more 
“speculative, philosophical, and somewhat 
promotional character”. 

In the early 1970s, many years after his 
death, Tesla’s enigmatic behaviour (such as 
his passion for feeding pigeons) and linger- 
ing reputation for grandstanding electrifying 
illusions helped to cement his appeal among 
free-energy claimants — who believe there is 
electricity in our environment waiting to be 
picked up with the right technology — and 
counterculturalists seeking mystery in the 
rational and material world. Public interest 
has spiked again in the past few years. In 2012, 
an online campaign raised close to a million 
dollars in a week for a Tesla museum. A recent 
YouTube video has Tesla duelling with Edi- 
son ina rap battle, and Christopher Nolan's 
2006 film The Prestige features David Bowie 
as Tesla. Tesla’s bold predictions and outsider 
glamour are still working their magic. 

Tesla shows that a deep creative drive, 
guided by a formidable intuition, can serve 
inventors well up to a point. Innovation may 
be a subjective process, but Tesla’s career 
demonstrates that it can also be undermined 
by believing that illusion has substance. = 


W. Patrick McCray is a professor in the 
Department of History at the University 

of California, Santa Barbara. His latest 
book is The Visioneers: How a Group of 
Elite Scientists Pursued Space Colonies, 
Nanotechnologies, and a Limitless Future. 
e-mail: pmccray@history.ucsb.edu 


BOOKS & ARTS | COMMENT | 


Books in brief 


Denial: Self-Deception, False Beliefs, and the Origins of the 
Human Mind 

Ajit Varki and Danny Brower TWELVE 384 pp. $27 (2013) 

Do you skydive? Deep-fry? Chain-smoke? Denial of mortality is a 
strange trait that is also key to human nature, argues medic Ajit 
Varki. His argument stems from the ideas of late geneticist Danny 
Brower, who asked why species such as chimpanzees have not 
evolved to be aware of both self and the minds of others. Varki 
speculates that such intersubjectivity could only arise in tandem 
with ‘death blindness’, as fear would otherwise hamstring a species’ 
fitness. A thoughtful foray into “mind over reality”. 


PHA 
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The Mystery of the Hanging Garden of Babylon: An Elusive World 
Wonder Traced 

Stephanie Dalley OXFORD UNIVERSITY PRESS 304 pp. £25 (2013) 

The puzzling dearth of research on Babylon’s Hanging Garden, one 
of the seven wonders of the ancient world, prompted Assyriologist 
Stephanie Dalley to methodically sift the evidence. Her perusal of 
cuneiform tablets, rock reliefs and Latin texts yielded research gold, 
overturning long-held ideas about the creator and location of this 
vertiginous marvel. From its fantastical landscaping to its advanced 
irrigation system, the garden emerges as a wonder indeed — of 
engineering, aesthetics and metaphoric richness. 


The Longevity Seekers: Science, Business, and the Fountain of 
Youth 

Ted Anton UNIVERSITY OF CHICAGO PRESS 240 pp. $26 (2013) 

A “silver tsunami” is upon us, writes Ted Anton: by 2050, one-third 
of people in the developed world will be over 60. The time has come 
to tease out the “molecular tipping points” involved in maintaining 
geriatric health, Anton avers. Kicking off with molecular biologist 
Cynthia Kenyon — who in 1993 pinpointed a single-gene mutation 
that doubles the lifespan of the nematode Caenorhabditis elegans 
— Anton reveals a young field already rife with larger-than-life 
personalities and lab drama aplenty. 


Kith: The Riddle of the Childscape 

Jay Griffiths HAMISH HAMILTON 432 pp. £20 (2013) 

Sojourning on several continents to research her bestselling Wild, 
Jay Griffiths noted big differences between children from indigenous 
and Western cultures. She now grapples with that riddle, arguing 
that “human nature is nested in nature which co-creates the 
child”. Her probings of the meeting point of developing psyche and 
environment interweave history, anthropology and memoir. But 
does an urban existence enfeeble the young? What is abundantly 
clear, yet sidelined in this often brilliant, poetically nuanced work, is 
the ferocious adaptability of our species and our children. 


Billion-Dollar Fish: The Untold Story of Alaska Pollock 

Kevin M. Bailey UNIVERSITY OF CHICAGO PRESS 288 pp. $25 (2013) 

The last time you ate something labelled just ‘fish’, it might well have 
been pollock. The flesh of this Alaskan species turns up in fish fingers, 
sushi and seafood salad. Kevin Bailey, a former senior scientist at the 
Alaska Fisheries Science Center in Seattle, presents the first natural 
history of this ubiquitous fish and an analysis of its population. 
Although the market for pollock — worth more than a billion dollars 
a year in the United States alone — seems buoyant compared with 
some others, Bailey unveils a familiar tale of steep decline. Barbara Kiser 
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An image of the Butterfly Nebula captured by the Hubble Space Telescope. 


Q&A: Marek Kukula 
Star-map historian 


Marek Kukula is public astronomer at London’ Royal Observatory in Greenwich and the 
curator of Visions of the Universe, an exhibition charting the trajectory of celestial imaging, 
with a focus on astrophotography. On the eve of its opening, Kukula talks about eighteenth- 
century star maps and the co-evolution of the telescope and camera. 


How did you get 
into astronomy? 

I decided I wanted 
to be an astronomer 
during a family trip 
to Jodrell Bank 
in Cheshire, UK, 
w\, when I was in my 
teens. I remember 
looking up at the 
giant radio tel- 
escope and learning about how scientists 
were using it to try to answer some of the 
biggest questions in the Universe. It seemed 
like a huge intellectual adventure. 


What links the Royal Observatory to the 
exhibition’s venue, the National Maritime 
Museum? 

The observatory was built to make very 
accurate maps of the stars for use in navi- 
gation, which is why astronomy had gov- 
ernment funding in the seventeenth and 
eighteenth centuries. It was really only in 
the nineteenth century that people began 
to use observatories for modern astronomy 
and astrophysics. 
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What is the idea behind Visions of the 
Universe? 

The focus is astrophotography and its evolu- 
tion, told through roughly 100 iconic images 
of space, primarily from the past 150 years 
[though there are also some sketches from 
the pre-photography era]. The role of the tel- 
escope in astronomy is familiar, but a parallel 
story has been less well told: the impact of 
the invention and development of the camera 
on this field. We wanted to reveal how the 
telescope and the camera have changed the 
way we see the Universe, a story astronomers 
can be very proud of. Although the earliest 
cameras were not really suitable because they 
needed quite a lot of light, astronomers were 
among the first people to recognize their 
potential, and helped to develop the technol- 
ogy toa point at which it was actually useful. 


How did you choose the images? 

It has been a really agonizing process trying 
to narrow them down. We tried to pick ones 
that tell a good story, and included some 
because they just look so beautiful. Some are 
hugely important in scientific terms, if not in 
aesthetic ones. The first image beamed back 
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in 1959 by one of the 
Soviet lunar probes as it 
went behind the Moon 
is a really grainy televi- 
sion picture. When we 
dragged it up on the 
Internet, we burst out laughing because it is 
such a rubbish image. But it is hugely signifi- 
cant, both scientifically and culturally. It was 
the first time anyone had seen the other side 
of the Moon. 

Other images combine artistry with sci- 
ence. There is a really famous one from the 
Cassini space probe of Saturn eclipsing the 
Sun. There's a technology story here: this 
is of course a view that’s impossible from 
Earth. Because we can send a camera ona 
spacecraft out to Saturn, we're able to see this 
really stunning view. Between the inner and 
outer rings is a tiny speck, which is Earth. 
Were in the picture as well. 


Visions of the 
Universe 
National Maritime 
Museum, London. 
7 June to 15 
September 2013. 


Which are your favourite images? 

One is of the Andromeda Galaxy taken by 
Edwin Hubble on 19 October 1923. At the 
time, there was a great debate about whether 
the Milky Way was the entire Universe. 
Hubble picked out one star that is variable 
in brightness, so he wrote “VAR! on the 
photographic negative. The key thing about 
variable stars is that if you can measure their 
variability, you can calculate their distance. 
When he worked this out, he discovered that 
the Andromeda Nebula is 2.5 million light 
years away, well beyond the Milky Way. This 
image revealed the Universe as hundreds of 
times bigger than we had assumed. 


What’s the thinking behind the pictures 
from the surface of Mars? 

I find those images incredibly moving. On 
one level they look ordinary and boring: 
there’s a desert; there are some rocks. But 
they're on another planet — that’s incredible! 
For thousands of years, people saw Mars as a 
red dot in the sky. About 400 years ago, they 
started to see it as a tiny disc and began to see 
landscape features. We are the first genera- 
tion in history to see it as a landscape, with 
pebbles and sand dunes and hills, and to 
drive our robot avatar through it. 


What do you enjoy most about being public 
astronomer at the Royal Observatory? 

I get to work alongside historians of sci- 
ence. At university you tend to see science 
as a linear progression, but the observatory’s 
curators remind me that scientists in earlier 
centuries had no idea of the correct route to 
pursue. That gives you a much better under- 
standing of why we do things in sometimes 
quite perverse ways in modern science: this 
is the legacy of how we've arrived at our 
modern understanding. 
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NASA/ESA AND THE HUBBLE SM4 ERO TEAM 
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Account for the ‘dark 
matter’ of biology 


Philip Ball correctly opines of 
DNA that “we should lift some 

of the awesome responsibility 

for life's complexity from its 
shoulders” (Nature 496, 419-420; 
2013). In addition to the “-omes’ 
he lines up for this responsibility, 
other components that are vital to 
all life forms are the metabolome 
(small-molecule intermediates), 
the lipidome (including cellular 
membranes) and the glycome 
(massive arrays of sugar chains 
on cell surfaces and extracellular 
compartments). 

We now know that these other 
‘“-omes’ mediate much of the 
diversity and complexity found 
in natural biological systems. Ball 
makes an apt analogy to the 1998 
discovery that the expansion 
of the Universe is accelerating, 
which forced cosmologists 
to think beyond the standard 
model and realize that dark 
matter and dark energy actually 
dominate many key processes. 

Likewise, it is time for more 
biologists to venture beyond 
the standard model of biology 
(involving DNA, RNA and 
proteins) that has served us so 
well, and take into account the 
‘dark matter’ of the biological 
universe — as well as the 
physical, biological, social and 
cultural environment. 

Ajit Varki University of California, 
San Diego, La Jolla, USA. 
alvarki@ucsd.edu 


Getting the word out 
on biosphere crisis 


A landmark statement released 
last week, formulated froma 
review published in Nature last 
year, illustrates how effectively 
and rapidly bridges can be built 
between science and society. 
The review suggested that 
human influence may be forcing 
the global ecosystem towards a 
rapid, irreversible, planetary-scale 
shift (A. D. Barnosky et al. Nature 
486, 52-58; 2012). Californias 
governor, Jerry Brown, invited 


the authors to draw their 
findings to the attention of 
policy-makers, industry and the 
public. An international group 
of 16 scientists spent the next few 
months developing the statement 
and circulating it to the global- 
change scientific community. 

The resulting statement — 
‘Maintaining humanity’s life 
support systems in the 21st 
century’ (see go.nature.com/ 
prudoq) — is endorsed by 
more than 500 global-change 
researchers whose work spans 
every continent. It warns that 
unless decisive countermeasures 
are put into place immediately, 
climate change, loss of 
ecological diversity, extinctions, 
environmental contamination, 
human population growth and 
overconsumption of resources 
will degrade our quality of life 
within a few decades. 

The statement is already 
catalysing interactions between 
governments, universities, 
business leaders and scientists 
to educate local constituencies 
about global issues and to 
stimulate solutions. 

Elizabeth A. Hadly* Stanford 
University, California, USA. 
hadly@stanford.edu 

*On behalf of 4 co-signatories. See 
go.nature.com/sgrxwo for full list. 


Boost to translational 
medicine in Europe 


Next week will mark the launch 
of the European Infrastructure 
for Translational Medicine 
(EATRIS; see go.nature. 
com/3li8fs). EATRIS aims 
to help clients (industry, 
small enterprises, funding 
organizations and academic 
institutions) to translate their 
biomedical discoveries efficiently 
into preventive, diagnostic or 
therapeutic products up to the 
point of clinical proof of concept. 
More than 60 leading 
translational-research centres will 
make their facilities and expertise 
available to researchers and 
industry across Europe, through 
a single point of entry, bringing 


together basic scientists and 
clinicians, as well as public and 
private partners. These centres 
may be specialists in, for example, 
advanced therapies, imaging or 
biomarkers. 
EATRIS will minimize 
development costs and risks 
by matching expertise and 
optimizing the use of academic 
know-how, infrastructure 
and patient groups. Quality 
control, intellectual property 
and legal frameworks will all be 
standardized. Rapidly assembled 
multidisciplinary teams will 
address each project's specific 
regulatory and clinical needs. 
These measures should also 
attract much-needed attention to 
rare and orphan diseases. 
Giovanni Migliaccio, 
Frank H. de Man, Anton E. Ussi 
EATRIS Coordination and 
Support, Amsterdam, the 
Netherlands. 
giovannimigliaccio@eatris.eu 


Don’t oversimplify 
psychiatric disorders 


Psychiatric diagnosis is indeed 
imperfect, subjective and not 
based on pathophysiology 
or causation (Nature 496, 
416-418; 2013). But quests 
for biological markers and 
dimensional approaches, such 
as the Research Domain Criteria 
project mentioned, are unlikely 
to be clinically appropriate 
solutions because they run 
the risk of oversimplifying 
complex psychiatric illnesses and 
introducing predicative logic. 
For instance, people with 
post-traumatic stress disorder 
(PTSD) or psychosis may show 
similar abnormalities of frontal- 
lobe function on a magnetic 
resonance imaging scan, just as 
those with anxiety or psychosis 
may show similar overactivation 
of the amygdala in the brain. 
But the underlying reasons for 
these superficial similarities 
are different, and so are the 
treatments. For example, lithium 
is not an effective treatment for 
psychosis, anxiety or PTSD, but it 


is near-curative for some people 
who meet current diagnostic 
criteria for bipolar disorder. 

The genetics of some 
conditions may overlap (for 
example, schizophrenia, autism 
and bipolar disorder), as do the 
genetics of, say, multiple sclerosis 
and Crohn’s disease. However, 
these are distinct clinical 
conditions that require different 
interventions. 

Caution and healthy 
scepticism are essential before 
embracing fashionable trends to 
revise psychiatric diagnosis. 
Daniel R. Weinberger Johns 
Hopkins University School of 
Medicine, Baltimore, Maryland, 
USA. 
drweinberger@libd.org 


Communications 
need NIH funding 


The US Congress launched 
an enquiry into all ‘public 
relations expenditure by the US 
National Institutes of Health 
(NIH) following coverage in The 
Cancer Letter of spending by the 
National Cancer Institute (NCI) 
Office of Communications and 
Education (OCE; see Nature 
495, 142; 2013). In our view, the 
criticisms devalue OCE activities 
and risk diverting attention 
away from the budget cuts that 
reduced the NIH’s capacity to 
fund biomedical research. 
Critics should note that several 
OCE activities are mandated or 
requested by Congress, and that 
OCE expenditure for the last 
fiscal year is less than 1% of the 
NCT’s, so cutting this further will 
barely affect the NCI’s research- 
funding capacity. 
Communications are an 
important part of the NIH’s 
mission. The NIH can only 
benefit from making NCI services 
and discoveries more accessible 
to cancer patients, physicians, 
researchers and the public. 
Judith S. Bond, Bethany 
Drehman Federation of American 
Societies for Experimental Biology, 
Bethesda, Maryland, USA. 
bdrehman@faseb.org 
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COGNITIVE NEUROSCIENCE 


Time, space and memory 


The brain’s hippocampus contains place cells, which encode an animal’s specific location. The finding that hippocampal 
neurons may also respond to time could provide information on the coding of episodic memories. 


GYORGY BUZSAKI 


he philosopher Immanuel Kant made 

life difficult for neuroscientists by pos- 

ing the following dilemma: does the 
brain represent time and space, which are 
themselves actual entities, or does it produce 
time and space and impose these categories 
on the world with which we interact? A more 
mundane, yet important, practical issue is 
whether there is a dedicated time-keeping 
mechanism in the brain, similar to the clock 
of a computer. The hippocampal region of 
the brain has long been suspected to repre- 
sent Kantian space’. Now, in work published 
in Neuron, Kraus and colleagues” attempt to 
address the hard problem of time by describing 
‘time cells’ — guess where, in the hippocam- 
pus. The implications of these findings are 
potentially far reaching. 

The authors trained thirsty rats to run on 
a treadmill for tens of seconds for a reward of 
water, while recording the activity of groups 
of pyramidal cells in the animals’ hippocampi. 
The aim was to distinguish neurons that ‘track’ 
elapsed time from those that track the distance 
run on the treadmill. Distance is, of course, a 
simple product of time and running speed. But 
the researchers manipulated these three vari- 
ables by changing the speed of the treadmill 
from trial to trial, and requiring the rats to run 
for either a constant distance or a constant time 
on alternate days. 

Members of the hippocampal-cell popu- 
lation that Kraus et al. recorded were active 
transiently and sequentially, so that the entire 
duration of the run was evenly represented by 
neuronal activity within this population. Using 
exemplary statistical and computational- 
modelling methods, the authors then evalu- 
ated the contribution of elapsed time and the 
distance run to the activity patterns of each 
recorded neuron. 

In agreement with previous studies*“, most 
of the recorded cells responded to an insep- 
arable combination of time and distance. 
However, a minority of cells (still members 
of the broad distribution in the time-distance 
dimension) was mainly under the control of 
time spent on the treadmill, and the activity 
of an equally small fraction was distinctly cor- 
related with distance. Kraus and co-authors 
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Figure 1 | Externally driven and self-organized cell assemblies track time. a, Evolutionarily simple 
brains contain simple neural networks. Sensory input from the body and the environment activates 
input neurons, which interact with output neurons to generate appropriate reflex actions in a short time 
window. b, In more complex brains, multiple interacting loops of increasing length improve prediction 
of more elaborate events that occur at longer timescales. c, After extensive training, the loops can sustain 
self-organized, long-lasting neuronal sequences without reliance on external cues and can, therefore, 
support cognitive operations such as memory, planning and imagination. Progression of neuronal 
operations correlates with elapsed time (green arrows) irrespective of whether the operations are driven 


externally or internally. 


acknowledge that “it is impossible to com- 
pletely separate time and distance”. Nonethe- 
less, they interpret their results in favour of 
a dedicated mechanism in which this small 
population of ‘time cells’ specifically keeps 
track of time. This mechanism is distinct 
from that of path integration, by which dis- 
tances and directions the animal takes are 
integrated with the help of the evolving cell 
assemblies. 

The activity of time cells and path integra- 
tion during navigation are only part of the 
hippocampus’s story. This structure is also 
our resident ‘search engine, which allows us 
to navigate in ‘mental space’ when recalling 
memories or planning future actions’. Kraus 
et al. suggest that the time cells they have iden- 
tified are a key missing piece of episodic mem- 
ory, the long-term memory that enables us to 
recall specific events and experiences, because 
such memories are embedded in a spatio- 
temporal context. To scrutinize this inter- 
pretation, it is useful to consider the operations 
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of the brain in a wider context. 

Brains are predictive devices and exploit the 
fact that recurrence is a fundamental prop- 
erty of the world around us. Experience and 
memory allow the recall of similar situations 
and the deployment of previously effective 
actions. In simple neuronal circuits, such 
as those of invertebrates, signals from the 
environment or the body can trigger appro- 
priate (learned) responses within a relatively 
short time window (Fig. 1). With increas- 
ing organismal complexity, ever-increasing 
loops of neuronal networks are added to the 
basic circuit to improve prediction of more 
complex events and those with longer tem- 
poral separation between the input signals 
and the responses. After sufficient training, 
the long loops of larger brains can dispense 
with the reliance on external cues by inter- 
nally processing the probabilities of outside 
events and their most likely outcomes. This 
disengagement is a necessary condition for 
cognition’. 


Returning to the findings of Kraus and 
colleagues, these suggest that progression of 
neuronal information within cell assemblies in 
the hippocampus during spatial navigation can 
be controlled by environmental or body cues 
(for path integration) and by a time-tracking 
mechanism. Alternatively, episode-specific 
activity sequences of cell assemblies may roll 
forward as a result of self-organization in the 
absence of changing external cues, always pro- 
gressing along the path of the highest-proba- 
bility events’. This latter perspective suggests 
that the qualitative distinction between the 
causes of sequential neuronal activity is along 
the dimension of external dependence versus 
internal self-organization, rather than being 
related to time or distance. 

Brains, like clocks, do not produce time 
per se. So despite our intuition of separate time 
and space, it may be that the brain generates no 
such things. Instead, variations in the strength 
of synaptic communication between neurons 
may simply determine the direction of activ- 
ity flow across neurons under all conditions. 
The temporal flow of activity is a framework 
for recalling thousands of episodic memories 
or planning multiple possible consequences 
of actions. The evolving neuronal-assembly 
sequences that support these cognitive opera- 
tions may activate all hippocampal neurons at 
some point, including those that may some- 
times appear in the disguise of ‘time-tracking- 
only’ cells. 

Many brain regions can generate sequen- 
tial activity paced for their own needs*’” 
— ranging from the subsecond scale in 
the service of perception and motor con- 
trol to much longer timescales in memory, 
planning and imagination. These opera- 
tions progress along a time line but do not 
need appointed time cells. Admittedly, the 
questions Kraus and colleagues address are 
among the most complex in science, and the 
authors should be commended for tackling a 
hard problem at the intersection of philoso- 
phy and neuroscience. As is always the case 
with good science, their findings raise as 
many questions as they tried to answer. I have 
addressed only a few, and leave the rest to the 
Kant scholars. m 
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A spruce sequence 


The first published whole-genome draft sequence of a gymnosperm, the Norway 
spruce, provides a powerful platform for studying the unique development, 
adaptation and evolution of this major group of plants. SEE ARTICLE P.579 


RONALD SEDEROFF 


ithin the gymnosperm subgroup 
of seed plants are the iconic coni- 
fers, which dominate many forest 


ecosystems of the cold-temperate and sub- 
tropical regions of the Northern Hemisphere. 
Among the conifers are Earth’s oldest living 
individual plants, the bristlecone pines; its 
largest trees, the giant sequoias; and its tallest, 
the coast redwoods. Conifers also include the 
pine and spruce genera that supply much of the 
world’s wood for pulp, paper and solid-wood 
products. Genetic analysis is the key to under- 
standing the biology of these trees, but gym- 
nosperms typically have very large genomes, 
of up to 37 gigabases', and an abundance of 
repetitive DNA, making their sequences dif- 
ficult to assemble. So the sequence of the 
Norway spruce genome, reported by Nystedt 
et al.” on page 579 of this issue, represents a 
major technical, as well as an information-rich, 
achievement”. 

Gymnosperms, of which there are around 
1,026 species’, are vascular plants, meaning 
that they contain a tubular tissue network that 
transports water and nutrients, and provides 
mechanical support. The gymnosperms are 
one of two subgroups of seed-forming vascu- 
lar plants, the other being the angiosperms, 
the flowering plants. There are some 350,000 
species of angiosperm, some woody and others 
herbaceous, and this subgroup includes all our 
major food crops. 

The seeds of angiosperms are enclosed in 
an ovary, whereas those of gymnosperms are 
in an open state. The two subgroups also dif- 
fer in terms of their mechanisms of growth, 
development, metabolism, adaptation and 
evolution* — in factors including wood 
microanatomy, water transport, mechani- 
cal support, reproduction, development and 
ability to adapt to environmental change. The 
vascular system of conifers depends on long, 
thin cells with lateral pits called tracheids, a 
primitive system also found in early progym- 
nosperm fossils. Angiosperms typically have 


*This article and the paper under discussion? were 
published online on 22 May 2013. 
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more complex wood-cell anatomy, in which 
vessels with much larger diameters facilitate 
water transport and more-specialized fibre 
cells give mechanical support. The phenolic 
polymer lignin contributes to mechanical 
support and provides a hydrophobic surface 
for water transport in both angiosperms and 
gymnosperms, but the composition of the pol- 
ymer differs markedly between the two groups 
of plants. 

Identifying the genes that underlie these 
differences is of interest for both basic and 
applied research, but the long reproductive 


Figure 1 | Gymnosperm genes. Trees of the 
gymnosperm subgroup make up much of the 
forests of the Northern Hemisphere and provide 
a large fraction of the world’s wood. Nystedt et al.’ 
have presented the first draft whole-genome 
sequence of a gymnosperm, the Norway spruce 
(Picea abies). 
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cycles and large sizes of gymnosperms have 
made traditional, breeding-based analyses of 
these plants challenging. DNA-based tech- 
nology that can bypass these limitations has 
been particularly useful in forest trees, ena- 
bling genomic mapping, gene sequencing, 
genomic selection and genetic engineering. 
Whole-genome sequences are particularly 
powerful, because they provide a platform 
for a spectrum of new technologies, such 
as studies of an organism’s full transcrip- 
tome — its complement of transcribed RNA 
molecules. Nystedt and colleagues’ draft 
sequence is the first gymnosperm sequence 
to be published (Fig. 1), and a loblolly pine 
genome sequence (around 22 Gb) is expected 
to follow soon’. These conifer genomes are 
the largest plant genomes sequenced so far, 
much larger than the 17-kb genome of wheat® 
(an angiosperm). 

The spruce genome will not only acceler- 
ate the investigation of gymnosperm biology, 
it will also provide broader genetic and evo- 
lutionary insight. For example, researchers of 
the ENCODE project’ recently argued that 70% 
of the 3.2-Gb human genome is functional in 
some way. But large plant and animal genomes 
pose a challenge to this proposal. Consider, for 
instance, the spruce genome (around 20 Gb) 
and that of thale cress, the model plant Arabi- 
dopsis thaliana (0.135 Gb). If a similarly high 
proportion of these genomes were functional, 
what properties of gymnosperms could neces- 
sitate such a massively larger number of func- 
tional genetic elements? Repression of only two 
genes enables A. thaliana to extend its growth 
cycle and produce a substantial amount of 
wood’, suggesting that the large genomes of 
gymnosperms are not attributable to a peren- 
nial growth habit or wood formation. Early 
comparisons of transcribed genes indicated’ 
that the vast majority of genes in gymnosperms 
and angiosperms have homologues in the same 
gene families. Nystedt and colleagues esti- 
mate that the spruce genome contains around 
28,354 genes, which is very close to A. thaliana’s 
27,407. The angiosperms maize (corn), 
rice and poplar have estimated gene num- 
bers of around 40,000. But these coding 
regions comprise a fraction of the sequence 
of these large genomes, and what the func- 
tions of the remaining sequences may be is 
still obscure. 

Another curious aspect of gymnosperm 
genomes is the evolutionary conservation, in 
many species, of a haploid (single copy) chro- 
mosome number of 12, despite their genome 
sizes ranging from 9.7 to 37 Gb’. The genomes 
will also provide insight into mechanisms of 
ancient and recent evolutionary adaptation 
in plants. Gymnosperms are thought to have 
originated from progymnosperms 360 mil- 
lion years ago, but much about the origin of 
angiosperms remains a mystery. Although 
the fossil record supports a gymnosperm 
origin for angiosperms, estimates of the time 
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of angiosperm origins vary by more than 
100 million years. There is also controversy 
regarding relationships within the gymno- 
sperms, particularly the relationship of the 
conifers to the gnetophytes, which include 
the bizarre desert plant Welwitschia. 

The genome sequences of these trees will not 
only help us to understand the past, but may 
also increase our understanding of present- 
day northern-latitude forests. Gymnosperms 
became the dominant forest plants in the late 
Palaeozoic and the Mesozoic periods, around 
300 million to 70 million years ago. But during 
the most recent ice age, much of the northern 
latitudes were covered by ice. When the glaciers 
last retreated, only about 10,000 years 
ago, the conifers were the major pioneer spe- 
cies that dominated that land. Understand- 
ing how the gymnosperms established new 
ecosystems as the glaciers shrank is becom- 
ing more important as we anticipate the 


PLANETARY SCIENCE 


effects of global climate change on the world’s 
forests. m 
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Evolutionary dichotomy 
for rocky planets 


A simple model shows that a rocky planet close to its star may solidify so slowly 
that its water is lost to space and the planet becomes desiccated, whereas a planet 
farther out may solidify quickly and retain its water. SEE LETTER P.607 


LINDA T. ELKINS-TANTON 


B=: and Venus were probably built 


from similar rocky materials, having 
been formed by similar mass-accretion 
processes. The giant impacts that character- 
ize these processes are thought to melt the 


Slow cooling 
dries planet 


growing planets to some depth, producing one 
or more magma-ocean stages during which the 
silicate portion of the planet is melted before 
solidifying. Thus, there has been no reason to 
suspect that Venus and Earth differed through 
the first tens and probably hundreds of mil- 
lions of years of the Solar System, and Venus 


Critical 
distance 


Rapid cooling 
traps water 


Figure 1 | Hamano and colleagues’ evolutionary model for terrestrial planets’. Rocky planets that 
form and solidify close to their host stars may receive enough heat from the star to slow cooling, allowing 
time for water to escape into space and causing the planet to dry out. Rocky planets that form farther 
from their stars, beyond a critical distance, would cool quickly, trapping water in silicate minerals in their 


interiors and as liquid on their surfaces. 
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is commonly thought to have lost its water 
through some later divergence from Earth-like 
evolution. On page 607 of this issue, Hamano 
et al.' present a simple model that might 
explain why rocky planets that have similar 
compositions but orbit at different distances 
from their host stars can end their magma- 
ocean stages with either an Earth-like wetness 
or a Venus-like dryness. This model does not 
require any later divergence to explain the dif- 
ferences between the planets. 

Almost 30 years ago, researchers showed 
how a dense steam atmosphere can be gener- 
ated on a young, hot planet by the solidifica- 
tion of an impact-generated magma ocean’. 
After the upper troposphere (the lowest por- 
tion of the atmosphere) of the young planet 
has become saturated with steam, that atmos- 
pheric layer imposes a strict upper limit on 
outgoing radiation from the magma ocean — 
about 300 watts per square metre. Therefore, as 
soon as the magma ocean produces a steam- 
saturated troposphere, the cooling rate of the 
planet is controlled by this one simple limit. 

Previously, several groups had calculated 
that a magma ocean should solidify in just mil- 
lions of years**. These calculations assumed 
that the planet had lower incoming heat flux 
from the star than outgoing heat flux from the 
magma ocean. The crucial feature of Hamano 
and colleagues’ model is that some planets 
are close enough to their star for the incom- 
ing heat flux to be higher than the 300 Wm” 
outgoing radiation limit, and thus the planet 
would be prevented from cooling at all until 
water was lost from the steam-saturated 
atmosphere. 

For planets close to their star, solidification 
and cooling together may take orders of mag- 
nitude longer than for more distant planets, 
perhaps as long as hundreds of millions of 
years. After solidification, cooling proceeds 
only as water is stripped from the hot, inflated 
atmosphere by hydrodynamic escape. The 
longer that solidification and cooling take, the 
more water is lost to space, and the drier the 
planet becomes. Thus, the distance ofa terres- 
trial planet from its host star might produce an 
evolutionary dichotomy (Fig. 1). 

The authors further suggest that Earth 
solidified far enough from the Sun to have a 
net loss of planetary heat from the beginning, 
allowing it to solidify quickly. Earth’s initial 
water inventory influenced the volume of 
only its initial oceans. Venus, however, may 
have had net heat flux into the planet, and its 
current dryness might be related to this early 
slow solidification and attendant atmospheric 
water loss, before cooling allowed the water in 
the steam atmosphere to cool and condense 
into liquid oceans. 

Recent work on geochemical tracers has 
indicated that Earth’s mantle solidified and 
differentiated from a magma ocean more 
than 4.45 billion years ago®, probably around 
4.52 billion years ago’, which agrees with rapid 
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solidification. For Venus, however, there are 
insufficient geochemical data to perform this 
test. Measurements of deuterium and hydro- 
gen in the Venusian atmosphere indicate 
that the planet has lost a substantial amount 
of water over time®’, but whether that loss 
occurred at the time of solidification or more 
recently is a matter of argument. 

The authors’ model underscores the impor- 
tance of the earliest accretion and solidifica- 
tion steps in determining the future evolution 
of the rocky planets. However, several crucial 
caveats need to be considered in applying this 
model. First, in extrapolating back in time, the 
faint young star’s radiation level needs to be 
considered. Second, initial atmospheres might 
not all be water-rich; the rocky building blocks 
for some planets might have produced atmos- 
pheres rich in methane and hydrogen, instead 
of steam”*. In the absence of a steam atmos- 
phere, there would be no outgoing radiation 
limit to slow solidification and cooling. Third, 
forming an initial atmosphere above a magma 
ocean is not a simple process. The removal of 
volatile gases from magma might require a sig- 
nificant degree of supersaturation and might 
not occur until late in solidification. If this is 
so, then solidification would proceed to a high 
degree before a steam atmosphere formed and 
occluded heat flux. 

Although proximity toa star affects planetary 
water content, this is not the only parameter 
that dictates the habitability of a rocky planet 
— the planet’s composition also has a strong 
influence on all aspects of habitability, such as 
bulk atmospheric composition, susceptibility 
to plate tectonics and formation ofa shield- 
ing magnetic field. A challenge for the com- 
ing decades will be to make measurements of 
exoplanets that allow the testing of models for 
habitability, and these tests need to include 
composition. How do atmospheric species 
other than water affect the solidification rates 
of magma oceans? What atmospheric compo- 
sitions would be expected in the wake ofa slow 
solidification with substantial water loss? The 
habitability of Earth and the inhospitability 
of Venus may be the inevitable result of our 
planetary sibling order next to the Sun rather 
than later evolutionary bifurcations. If so, simi- 
lar patterns of habitability are likely to be found 
in exoplanets. = 
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It’s an analog world 


The first synthetic genetic circuits to use analog computation have been developed. 
These circuits involve fewer components and resources, and can execute more 
complex operations, than their digital counterparts. SEE LETTER P.619 


HERBERT M. SAURO & KYUNG HYUK KIM 


he engineering of biological networks, 

a discipline called synthetic biology, 

has seen remarkable progress since 
its inception in 2000. The fact that we can 
now design simple but functional circuits 
in vivo is probably one of the most impor- 
tant aspects of this progress — it suggests that 
our biophysical understanding of the cellular 
milieu is largely correct, even if many details 
remain to be resolved. The significance of 
this should not be underestimated, because it 
means that predictive engineering of new cel- 
lular networks and systems is possible. Until 
now, approaches to synthetic biology have 
been greatly influenced by the digital com- 
puting technology that we use every day. But 
biological cells do not process information 
in a solely digital fashion; rather, they carry 
out many operations in an analog manner. 
In this issue, Daniel et al.' (page 619) break 
the mould and venture into a completely 
new area of synthetic biology, presenting 
synthetic circuitry that can perform analog 
calculations*. 

Synthetic biology has already led to several 
notable firsts — from the construction of 
switches, oscillators and feed-forward networks 
to the creation of gene regulatory circuits that 
can carry out Boolean logic. In many ways, 
digital computing has coloured our approach 
to synthetic biology. It is a truly remarkable 
achievement that so much can be done with 
just the ‘ones’ and “zeros’ that form the basis of 
digital processing. After more than 60 years of 
digital computing, it is no surprise that many 
modes of thinking have become highly influ- 
enced by the digital paradigm. It is often said 
that a biological cell is just like a digital com- 
puter, and that reprogramming a cell is like 
writing software. But the digital analogy is a 
misleading one, because many processes that 
occur in a biological cell have no counterpart 
in a digital computer. If reprogramming a 
cell was just a case of writing code, then re-engi- 
neering cells would be simple — but it is not. 


*This article and the paper under discussion’ were 
published online on 15 May 2013. 


572 | NATURE | VOL 497 | 30 MAY 2013 


The reality is that cells use a hybrid approach 
to information processing. In some cases they 
use digital yes-or-no decisions, but in many 
cases cellular signals are analog, with levels 
of gradation. More exotic, little understood 
signal-processing techniques involving noise 
and other forms of signal probably also con- 
tribute. And on top of that, a complex chem- 
istry exists that continuously reassembles the 
cell in real time. 

Nevertheless, constructing digital devices is 
both a useful and an interesting engineering 
challenge for synthetic biologists, and many 
advances have been made in mimicking digital 

systems either in vivo 


“The ability to or in vitro using DNA 
process grad ed as components. One 
° . remarkable study 
ipl ec was the construction 

y theti of an in vitro 4-bit 
Ha (i ee ad square-root calcu- 

ra ogica z lator consisting of 
circuits willbeof 139 DNA strands”. 
interest tomany Another was the con- 


»” 
researchers. struction ofan in vivo 


system to detect four 
different inputs, comprised of four sensors and 
three logic gates’. 

Although these are significant achieve- 
ments, such circuits require large numbers 
of components to perform even the simplest 
computation. A 4-bit binary adder, for exam- 
ple, might require 30 or more proteins to 
operate, and at the same time would place a 
substantial metabolic burden on a cell. More 
challenging and perhaps more interesting is to 
attempt to mimic analog computation in a cell, 
as Daniel et al. have done. 

One key advantage of analog over digital is 
that far fewer devices are needed to carry out 
a given computation at the moderate preci- 
sion needed in cells’, and fewer devices mean 
lower resource requirements. In addition, the 
richness of signal processing that can be car- 
ried out in analog systems is far greater than 
what can be accomplished in digital systems 
using the same number of components. But 
analog information processing brings with it 
anew set of challenges. Compared with that of 
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digital systems, the design of analog circuitry 
requires greater expertise in feedback-system 
design, circuit theory and signal processing. 
Daniel and colleagues’ work is founded on the 
close analogy they saw between the exponen- 
tial thermodynamic electron flow that occurs 
in transistors and the exponential thermo- 
dynamic rates seen in chemical reactions’. This 
similarity led them to realize that analog elec- 
tronic circuits that operate in the logarithmic 
domain might be effective mimics of analog 
biological circuits that operate in the logarith- 
mic domain. 

To test this idea, Daniel et al. constructed a 
gene-transcription unit in which the concen- 
tration of a transcription factor (a protein that 
regulates gene transcription) could be loga- 
rithmically transformed over a wide range of 
concentrations, thus enabling fine control of 
gene expression. 

Moving an input signal into the logarith- 
mic domain has many benefits, especially that 
basic calculations such as division and multi- 
plication can be more easily executed in log 
space. The core of the authors’ logarithmic 
unit is a positive-feedback loop operating 
in an analog mode. A shunt is included that 
removes excess transcription factor so as to 
increase the range of operation and efficiently 
implement positive feedback. With this basic 
logarithmic unit in place, the researchers could 
then focus on adding higher-order functions. 
For example, by attaching a repressor module, 
they created a circuit whose readout was the 
negative of the log of the input. Another cir- 
cuit simply summed the activity of two parallel 
logarithmic circuits to mimic a multiplication 
operation. 

Furthermore, using just two transcription 
factors, the authors were able to create a neg- 
ative-feedback analog circuit that scaled the 
logarithmic function. Especially interesting 
was their construction of a circuit that com- 
puted the ratio of two inputs over four orders 
of magnitude of input concentration. Comput- 
ing a ratio might have many applications, such 
as normalizing or comparing values, or even 
providing an in vivo pH meter. 

The ability to process graded information by 
means of synthetic biological circuits will be of 
interest to many researchers. For example, most 
inputs from the environment are graded, and 
a synthetic circuit with a fluorescent protein 
readout might be able to represent the rate of 
change of an environmental input rather than 
its absolute level. This could be accomplished 
by having an analog differentiator circuit. Other 
analog designs might be used to measure the 
weighted sum of environmental inputs, such 
that when a particular combination of inputs 
is reached it triggers a change in the state of the 
cell. Designing such analog circuitry may also 
further our understanding of natural systems. 
The function of many regulatory systems in 
cells remains obscure, and studies of synthetic 
biology in the analog domain should lead to 


new theories on how biological systems process 
information, and thus allow such systems to be 
more finely controlled. = 
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Hesitation on 
hominin history 


Extensive studies of fossil skeletons of Australopithecus sediba provide 
fascinating details of the anatomy of this hominin species, but do not 
convincingly indicate its position on the evolutionary route to modern humans. 


WILLIAM H. KIMBEL 


he evolutionary events that led to 

the origin of the Homo lineage are an 

enduring puzzle in palaeoanthropology, 
chiefly because the fossil record from between 
3 million and 2 million years ago is frustrat- 
ingly sparse, especially in eastern Africa. 
Much attention has been paid to two fossilized 
skeletons, found in approximately 2-million- 
year-old sediments at the Malapa cave site in 
South Africa, that are recognized as represent- 
ing the species Australopithecus sediba. 
These have been the focus of scrutiny 
because of both their excellent preserva- 
tion and claims'” that this hominin — a 
species more closely related to humans 
than to chimpanzees — lies at the base 
of the Homo lineage. A series of reports 
published in Science** sheds light on 
the morphology of A. sediba but, in my 
view, does little to elucidate its role in 
later human evolution. 

Dental morphology is a frequent 
source of information about hominin 
phylogeny but, in the first of these new 
papers, Irish et al.* take the unconven- 
tional step of using only the Arizona 
State University Dental Anthropology 
System — a graded series of minor 
crown variants originally devised to 
distinguish recent human populations 
from one another — to decipher rela- 
tionships between hominin species that 
are millions of years old. I have serious 
doubts about the phylogenetic mean- 
ing of morphological similarity in this 
case. These concerns are compounded 
by the authors’ reliance on the gorilla 
as the sole outgroup in their cladistic 
analysis. Their results link A. sediba 
exclusively to Australopithecus africanus, 
an older (approximately 2.7 million to 


Figure 1 | Australopithecus sediba. A series of papers” 
presents extensive studies of these two fossil skeletons, which 
date to approximately 2 million years ago. The authors compare 
the anatomy of this hominin to that of other species of the 
Australopithecus and Homo genera. 


2.3 million years old), potentially ancestral, 
southern African species with which it also 
shares some key cranial features’. If this find- 
ing is borne out by further work, then the rele- 
vance of A. sediba to the origin of Homo would 
be inextricably tied to that of A. africanus, 
whose own position in hominin phylogeny is 
by no means settled’. 

De Ruiter and colleagues’ analysis of the 
A. sediba mandible’ includes a measurement- 
based comparison in which the sub-adult 
individual MH1 (with only its second molar 
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erupted) is treated as though its growth had 
been completed. However, for most dimen- 
sions, hominoid mandibles achieve only 
around 75-90% of their adult values by the 
time of the second molar eruption”’. So, 
although the A. sediba mandibles seem to be 
small and lightly built (and thus Homo-like) by 
australopith standards, it is unclear how much 
of this impression is due to the authors’ use ofa 
sample comprising a sub-adult and a presumed 
adult female (MH2). 

Much of the value of the Malapa material 
lies in the extremely rare association of upper 
and lower limb parts with elements of the axial 
skeleton in two individuals of the same species 
(Fig. 1). These skeletons paint a portrait of a 
pectoral girdle that retains more ape-like anat- 
omy than the pelvic girdle". Churchill et al.” 
report that a fairly complete scapula (from 
MH2) features an upwardly tilted articulation 
for the humerus and a relatively broad attach- 
ment area for a muscle that helps to lift the arm 
over the head, a familiar australopith upper- 
limb pattern that also includes long, strong 
forearms and curved fingers. Although these 
features are embedded in a terrestrial bipedal 
frame, they are often interpreted as signs 
of retained ancestral arboreal climbing 
behaviour®”. Still unsettled is what 
led to the refashioning of the hominin 
shoulder by the time, around 1.6 million 
years ago, of Homo erectus, a species that 
shows modern upper-limb and shoulder 
morphology (this anatomy is unknown 
in the approximately contemporaneous 
Homo habilis). Simply leaving the trees 
seems to be an insufficient explanation. 

Schmid et al.° used the low curvature 
of the upper ribs of A. sediba to argue for 
a conical ribcage and elevated shoulders 
similar to those of the great apes, even 
though second- and fourth-rib curva- 
tures do not actually distinguish apes 
from humans. However, it is clear that 
the unusually strongly curved first rib 
articulates only with the first thoracic 
vertebra, as in humans and Australo- 
pithecus afarensis. This configuration 
is at odds with a completely ape-like 
upper thorax and has been associated 
with descent of the shoulder after the 
upper limbs were freed from locomo- 
tion, although this interpretation has 
been contested". A further puzzle is the 
A. afarensis partial skeleton KSD-VP 
1/1, which, although 1.6 million years 
older than the A. sediba skeletons, has 
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an upper thorax more similar to that of mod- 
ern humans”. A lower (ninth) rib from MH2, 
which is more like those of modern humans 
in its curvature and torsion, is consistent with 
the less flaring pelvic rim of A. sediba when 
compared with A. afarensis and A. africanus”. 

The ‘long-backed’ lumbosacral vertebral 
formula of six lumbar and four sacral ver- 
tebrae that is seen in species from australo- 
piths through to Homo erectus is the probable 
primitive condition for humans and the great 
apes’°. The evolution of the most common 
modern-human condition of five lumbar and 
five sacral vertebrae occurred by ‘sacralization’ 
of the lowermost lumbar vertebra. Williams 
et al.’ argue that MH2 is unusual for an early 
hominin in having this derived pattern, but the 
MH2 formula depends on how one defines a 
lumbar vertebra — by a lack of rib articula- 
tions or by functional criteria that relate to 
intervertebral movement. In early hominins, 
the first of the six functional lumbar verte- 
brae carries rib articulations that are similar 
to those of a thoracic vertebra. MH2, in fact, 
resembles other Australopithecus specimens in 
having six functional lumbars, but this is unex- 
pected in an early hominin with five sacral 
vertebrae. 

The last of the papers presents DeSilva and 
colleagues’ reconstruction of the A. sediba 
gait® (based on the MH2 skeleton), which 
will be controversial. The proposed ‘hyper- 
pronation of the foot and extreme inward rota- 
tion of the leg and thigh suggest an ungainly 
bipedal stride that might have made it into 
Monty Python's ‘Ministry of Silly Walks’ 
sketch. The presumed inversion of the foot 
at the heel-strike of the surprisingly ape-like 
calcaneus, combined with a vertical shank 
(tibia), outwardly angled thigh (femur) anda 
long, lordotic lower back — all hallmarks of 
terrestrial bipedality in Australopithecus and 
Homo species — constrains the reconstruc- 
tion. Prominent osteophytic growths on the 
pelvis and fibula at the attachment sites of the 
thigh musculature raise the possibility of a gait 
that was pathologically impaired, but DeSilva 
and colleagues argue that this locomotor pat- 
tern was adaptive. However, if A. sediba was a 
descendant of A. africanus, which, similarly to 
the even older A. afarensis (dating to between 
3.7 million and 3.0 million years ago), shows 
no trace of this pattern, then it is hard to imag- 
ine the selective advantage that would accrue 
from such a kinematically peculiar gait. 

Given the mix of features seen in A. sediba, 
it is difficult to understand why these research- 
ers insist that it lies at the base of the Homo 
lineage’. Similar intellectual gymnastics are 
required to comprehend the authors’ argu- 
ment that no African Homo fossils exist from 
before the time of A. sediba®"’. Although the 
recent papers constitute a fascinating further 
analysis of the A. sediba fossils, I do not think 
that they provide compelling evidence that 
this species is anything other than an unusual 


574 | NATURE | VOL 497 | 30 MAY 2013 


australopith from a Pliocene-—Pleistocene 
time period that is already populated by a fair 
number of them”. 
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A glimpse inside 


a mMagnetar 


Hundreds of neutron stars have exhibited ‘glitches’ in their spin-down rates — an 
indication of ultra-dense superfluids in their interiors. Now one highly magnetized 
star has shown a surprising glitch in the ‘wrong’ direction. SEE LETTER P.591 


ROBERT C. DUNCAN 


neutron star resembles a giant atomic 

nucleus, with 1—2 times the Sun’s mass 

packed into a ball about 20 kilometres 
across. Its gravity is so strong that a projec- 
tile would need to be launched at about half 
the speed of light to escape from its surface. 
Extreme density, pressure, temperature, mag- 
netism and relativistic gravity make these 
objects fascinating but challenging to study. 
Surprising observations of spin-down irregu- 
larities in one intensely magnetized neutron 
star, reported by Archibald et al.' on page 591 
of this issue, offer clues about exotic processes 
occurring deep inside these objects. 

The basic structure of a neutron star is 
generally agreed on. It has a crust about 1 km 
thick, in which nuclei are arranged in a crystal 
lattice immersed in a ‘sea of electrons. Near 
the surface, the nuclei are plain iron, but the 
pressure and density increase rapidly with 
depth, so that the nuclei become increas- 
ingly bloated and neutron-rich. At moder- 
ate depth, neutrons ‘drip’ out of the nuclei, 
forming a neutral liquid between the lattice 
nuclei. At the base of the crust, the bloated 
nuclei merge. Below this lies pure nuclear 
fluid, more than 200 trillion times denser than 
liquid water. 
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Unlike atomic nuclei, which contain almost 
equal numbers of protons and neutrons, the 
nuclear fluid inside a neutron star has roughly 
20 neutrons for every proton, a ratio main- 
tained by neutrino-emitting processes. There 
is also one electron or muon (‘heavy electror’) 
per proton, ensuring charge neutrality. These 
particles are forced together by tremendous 
pressure, but quantum mechanics requires 
them to occupy different states, so they fill 
all available energy states up to a high energy, 
the Fermi energy. Deep below the crust, as the 
pressure rises, neutron and proton Fermi ener- 
gies get so high that exotic, strongly interacting 
particles such as hyperons and mesons might 
join the mix. The actual fluid composition at 
very high densities is uncertain. It is possible 
that the innermost, central core consists of a 
‘soup of quarks, the elementary particles that 
make up protons and neutrons. 

Fortunately, observations of neutron stars 
yield insight into their interiors. The most 
thoroughly studied neutron stars are radio 
pulsars, which emit radio blips as they rotate. 
Timing the blips reveals that these stars stead- 
ily spin down. This is due to their intrinsic 
magnetism: as radio pulsars spin, they blow 
out magnetic waves and winds of fast charged 
particles, which carry away angular momen- 
tum. Interestingly, this steady spin-down is 


occasionally punctuated by ‘glitches’ — inci- 
dents in which the spin rate abruptly jumps 
up bya small fractional amount. Hundreds of 
radio pulsars have been accurately timed, and 
many hundreds of glitches observed”. The 
glitches are attributed to imperfect coupling 
of superfluids within the stars: as a neutron 
star spins down, superfluid components tend 
to spin faster than the rest of the star. A glitch 
occurs when the superfluid occasionally shifts 
closer to co-rotation. 

Although young neutron stars are hot, even 
by astrophysical standards, with interior tem- 
peratures commonly on the order of 10° kelvin, 
their strongly interacting neutrons can pair 
up and rearrange into quantum superfluids: 
phenomena found in low-temperature phys- 
ics laboratories’. In the inner crust, below the 
level of ‘neutron drip, the pairing neutrons 
have zero relative angular momentum, analo- 
gous to Cooper electron pairs in a supercon- 
ductor. In the much denser fluid beneath the 
crust, the short-range repulsive part of the 
neutron-neutron interaction does not favour 
this; instead, pairs have one quantum unit of 
orbital angular momentum, as in laboratory 
superfluid helium-3. Protons beneath the 
crust also pair into a liquid superconducting 
state if the local magnetic field is not prohibi- 
tively intense. Deeper in the star, any mesons 
present will probably form Bose condensates, 
another kind of superfluid. Even quark soup 
will pair up into an exotic state known as 
colour superconductivity” . Thus, at almost 
every depth within a neutron star, inter- 
penetrating superfluids exist that flow with- 
out viscous drag, and that could conceivably 
participate in glitches. 

The most popular model for radio-pulsar 
glitches holds the inner-crust neutron super- 
fluid responsible*®. One property of a rotat- 
ing superfluid is that all of its vorticity — all 
circulation tendency in the flow — is concen- 
trated in quantum vortex lines: multitudes 
of microscopic nodes, or holes, in the super- 
fluid that thread through the fluid parallel to 
the rotation axis. As the superfluid in a star 
spins down, these quantum vortices migrate 
outward towards the equator and ultimately 
annihilate near the stellar surface. But vorti- 
ces in the deep crust can become ‘pinned, or 
stuck, to the crust lattice nuclei. This keeps the 
inner-crust superfluid rotating faster than the 
rest of the star, until the vortices come loose in 
a catastrophic unpinning event, observed as a 
pulsar glitch. 

So far, so good. But in their study, Archibald 
and colleagues report an ‘anti-glitch. Instead 
of an abrupt spin-up, the star abruptly spun 
down. (Such phenomena have been seen 
before, but only at a much smaller level’.) 
Despite searches with sensitive radio and 
X-ray telescopes, no surrounding afterglow 
was detected, arguing against a sudden particle 
outflow that could have carried off the anti- 
glitch’s angular momentum. The most likely 
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Idealized model for a magnetar 


anti-glitch 


The star and its interior superfluid begin 
in co-rotation. The number of superfluid 
vortices penetrating the inner crust per unit 
area (blue shading) is highest at the north 
pole, but diminishes towards the equator 
owing to the curved stellar surface. A 
circular crust patch is then rotated (arrows) 
by plastic deformation, driven by stresses 
exerted by the twisted magnetic field below 
the crust. If this happens slowly and/or 
steadily, then its effect on the star’s spin- 
down history might not be obvious. 
Assuming that the vortices remain pinned 
to nuclei in the crust, once the patch has 
turned through a significant angle (180° 
in this illustration, much less in realistic 
applications’), the diminished number of 
vortices near the rotation axis will mean that 
the inner-crust superfluid is rotating more 
slowly than the rest of the star. A subsequent 
unpinning event would cause an anti-glitch. 


inference seems to be that some superfluid 
component within the star was rotating more 
slowly than the crust before the anti-glitch 
and/or was torqued-up by a sudden internal 
rearrangement. 

This is surprising, because the star in ques- 
tion, like all other solitary, magnetic neutron 
stars (including all radio pulsars), is spinning 
down monotonically, apart from occasional, 
ordinary glitches’. It is expected that internal 
superfluids can lag behind the general spin- 
down and act as faster-rotating ‘flywheels. But 
how could a stellar superfluid come to rotate 
more slowly than the crust and/or get abruptly 
spun up? 

The observed star is not a radio pulsar. It is 
a magnetar, an extremely magnetized neutron 
star with observable emissions powered by 
magnetic-field decay*. Magnetars are thought 
to be born spinning fast, with initial rotation 
periods of the order of several milliseconds. 
Their intense magnetism probably includes 
strongly wound-up interior-field components 
as a relic of this initial spin. The interior evolu- 
tion of these objects is dominated by diffusing, 
changing magnetic fields. These exert stresses 
capable of moving material around, especially 
within radially concentric shells inside the sta- 
bly stratified star — a possibility that may hold 
the key to the anti-glitch puzzle. 

Box 1 describes an idealized model for how 
an anti-glitch could arise in the inner-crust 
superfluid’. A more promising alternative 
explanation involves the core. In moderately 
magnetized zones of the outer core, type II 
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Alternatively, if crust vortices are 
unevenly distributed as a result of previous 
deformations or non-uniform pinning, then 
a fast, magnetically driven twist, perhaps 
involving fracturing around a crust patch, 
would cause an anti-glitch whenever the net 
concentration of vortices near the rotation 
axis goes up. R.C.D. 


Crust 
patch 


Rotation axis 


proton superconductivity requires that all 
magnetic flux is concentrated in quantum 
flux tubes — microscopic nodes in the super- 
conductor that are threaded by magnetic-field 
lines. These tubes resist passing through vor- 
tices in the interpenetrating neutron super- 
fluid where vortices and tubes intersect”. 
Thus, many flux tubes together can exert 
compelling forces on vortices. Elsewhere in 
the core, or where type I proton supercon- 
ductivity occurs, macroscopic flux structures 
with pronounced edges will also exert forces 
on vortices. As the star's magnetism evolves, 
any gradual tendency of these flux systems to 
drive vortices away from the rotation axis will 
spin down the core neutron superfluid, poten- 
tially setting up conditions for an anti-glitch. 
A sudden ‘breakthrough; or any instability by 
which vortices shift inward, would quickly spin 
up the superfluid and, as a result of angular- 
momentum conservation, spin down the rest 
of the star. 

Perhaps of relevance is the fact that recent 
studies'’”* have raised doubts about whether 
the inner-crust neutron superfluid is massive 
enough to account for ordinary radio-pulsar 
glitches. Therefore, some astrophysicists are 
already considering sub-crust superfluids as 
reservoirs for glitch angular momentum. How- 
ever this issue is resolved, the anti-glitch will 
probably provide insight into the interiors of 
neutron stars and help to illuminate the strange 
life histories of magnetars. m 
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Rear view of an enzyme 


The enzyme Ubc9 mediates attachment of the small modifier protein SUMO to 
target proteins. It emerges that for optimal functioning — and for proper meiotic 
cell division — Ubc9 itself must be modified by SUMO. 


MARY DASSO 


( "rte modification can change a 
protein’s fate and behaviour. One such 
modification is SUMOylation, which 

involves covalent attachment of the small pro- 

tein SUMO to target proteins. SUMOylated 
proteins can carry individual SUMO mol- 
ecules or chains, with each modification 
resulting in a different fate. Typically, the addi- 
tion of SUMO chains through the process of 
polySUMOylation marks the target for fur- 
ther tagging with chains of the SUMO-related 
protein ubiquitin and so for degradation’. 

Writing in Molecular Cell, Klug et al.” show that 

enhanced polySUMOylation is essential for 

meiotic cell division. The authors also describe 
how the balance is tipped between the addition 
of a single SUMO and a SUMO chain*. 

The attachment of SUMO to other proteins 
involves the sequential action of El and E2 
enzymes’. The El enzyme Aos1/Uba2 binds 
to and activates SUMO, before establishing a 
thioester linkage between SUMO and the E2 
enzyme Ubc9. This latter enzyme catalyses the 
formation of a stable isopeptide bond between 
the carboxy terminus of SUMO and lysine 
amino-acid residues in the target protein. 
This step frequently involves a third enzyme, 
E3 ligase. 

Amino acids adjacent to the target protein's 
acceptor lysine are often important for Ubc9 
activity, particularly when E3 enzymes are not 
involved. In fact, many target proteins carry 
a ‘consensus’ sequence that is preferentially 
used for SUMOylation in this situation. There 
are several such consensus sequences in the 
flexible amino-terminal domain of the SUMO 
protein found in the budding yeast Saccharo- 
myces cerevisiae, and the lysines in them act 


*This News &Views article was published online 
on 22 May 2013. 
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as the main sites for building SUMO chains’. 
Klug et al. find that, remarkably, the activity 
of S. cerevisiae Ubc9 is controlled by its own 
SUMOylation. Previous work*” had shown 
that yeast Ubc9 can be SUMOylated at two 
lysine residues near its own carboxy terminus, 
and that these modifications negatively regu- 
late the ability of Ubc9 to conjugate SUMO 
to target proteins. The present paper demon- 
strates that although such SUMOylated Ubc9 
(Ubc9*SUMO) is catalytically inactive as an 
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E2 enzyme, it nevertheless promotes poly- 
SUMOrylation. This may seem paradoxical, 
but Klug and colleagues show that enhanced 
polySUMOylation reflects the ability of 
Ubc9*SUMO to act as a scaffold (Fig. 1). 

An earlier study has shown’ that Ubc9 can 
interact non-covalently with SUMO ata sur- 
face that is both spatially and functionally 
distinct from its catalytic site. Notably, this 
rear surface must be intact for Ubc9 to medi- 
ate polySUMOylation efficiently. Klug and 
co-authors show that, in this Ubc9*SUMO 
interaction, the SUMO moiety binds to the 
rear surface of an active Ubc9 (which the 
authors call the enzyme’s backside), which is 
linked to another SUMO through a thioester 
bond. At the same time, the catalytic site of 
Ubc9*SUMO binds the consensus motifs of 
the thioester-linked SUMO, to place it in an 
optimal configuration for isopeptide-bond 
formation with the next SUMO protein as 
the latter is recruited by the catalytically 
active Ucb9. 

This intricate mechanism shares some fea- 
tures with mechanisms reported earlier, in 
which E2 and E2-like proteins function not as 
enzymes but as scaffolds during the conjuga- 
tion of target proteins. For example, one type of 
ubiquitin chain is assembled by an E2 enzyme 
in association with an inactive E2-like acces- 
sory protein’. The accessory protein binds an 
incoming ubiquitin so that the appropriate 
lysine residue of this ubiquitin is correctly ori- 
ented with respect to the catalytic site of the 
active E2 enzyme, ensuring that this lysine is 
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Figure 1 | Forging a link in the SUMO chain. a, Attachment of the carboxy terminus ofa SUMO protein 
to the catalytic site of Ubc9 through a thioester bond potentiates enzymatic activity of Ubc9. By contrast, 
SUMO conjugation close to the C terminus of Ubc9 through an isopeptide bond (Ubc9*SUMO) blocks its 
enzymatic activity. Klug et al.’ find that Ubc9*SUMO promotes chain formation by acting as a scaffold. The 
SUMO moiety of Ubc9*SUMO binds to the rear of the active Ubc9 (b). The catalytic site of Ubc9*SUMO 
then recognizes a consensus sequence at the amino terminus of the SUMO attached to the active Ubc9, 
placing this SUMO in an optimal configuration for transfer to a target protein (c). The active Ubc9 
recognizes an incoming SUMO (d) and catalyses formation of an isopeptide bond between the thioester- 
linked SUMO anda lysine with the N terminus of the incoming SUMO (e). Chain release follows (f). 
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used to form the next link of the chain. 
Additionally, mammalian Ubc9 forms an 
extremely stable complex with RanBP2, a 
nuclear-pore protein®. This complex acts as 
a composite E3 enzyme: the RanBP2-bound 
Ubc9 has a structural role, with a second Ubc9 
catalysing isopeptide-bond formation between 
SUMO and its target protein. Intriguingly, the 
rear surface of Ubc9 that is stably bound to 
RanBP2 does not seem to be exposed, and 
so the orientation of the two Ubc9 molecules 
associated with RanBP2 probably differs from 
the arrangement of paired Ubc9 molecules 
described by Klug and colleagues. Collectively, 
these mechanisms suggest that E2 enzymes can 
be adapted for a variety of structural roles in 
conjugation pathways, and they hint that many 
more variations on this theme will be found. 
Klug and co-authors also investigated the 
significance of Ubc9 SUMOylation during 
meiosis — the process by which a parental 
cell containing two sets of chromosomes 
eventually divides into four daughter cells, 
each carrying a single chromosome set. Yeast 
cells containing a Ubc9 mutant that could not 
be SUMOylated failed to form a synaptonemal 
complex, a meiotic structure that promotes 
both genetic exchange between chromosome 
pairs and correct chromosome segregation to 
daughter cells. In these mutants, individual 


Drug for an 


components of the synaptonemal complex 
showed altered concentrations as well as faulty 
incorporation into the complex. These find- 
ings build on earlier results’ to indicate that 
the formation of SUMO chains is necessary 
for the synaptonemal complex to function, 
although many details of the meiotic defect 
remain unclear. 

The same Ubc9 mutants grew normally 
under all the non-meiotic conditions that Klug 
et al. tested, and showed no increased sensitiv- 
ity to stress conditions. This may imply either 
that normal growth and stress responses do 
not require polySUMOylation levels as high as 
those needed for meiosis, or that other mecha- 
nisms for achieving polySUMOylation can be 
invoked and are sufficient outside meiosis. The 
researchers note that wild-type Ubc9 becomes 
increasingly SUMOylated during meiosis, 
and it will be interesting to discover how this 
‘autoSUMOylation is triggered to promote the 
polySUMOylation of downstream targets. 

Ubc9 is the sole E2 enzyme in the SUMO 
pathway and so can act as a key regulatory 
point under circumstances in which the overall 
pattern of SUMOylation fluctuates in response 
to varying cellular conditions". Although 
different consequences of autoSUMOylation 
on different domains of mammalian Ubc9 
have been described’, it would be useful to 


‘undruggable’ protein 


Scientists have long aimed to develop drugs against the cancer-associated 
protein KRAS, but without success. An approach that targets the oncoprotein’s 
cellular localization reignites lost enthusiasm. SEE LETTER P.638 


NICOLE M. BAKER & CHANNING J. DER 


uman RAS genes have two claims to 
H notoriety. First, they make up the most 
frequently mutated oncogene family 
in human cancer, having a prevalence of one in 
every three cases’. Second, despite more than 
three decades of intensive effort, no effective 
pharmacological inhibitor of the RAS onco- 
protein has reached the clinic. So it is exciting 
that, on page 638 of this issue, Zimmermann 
et al.” report* the identification and characteri- 
zation ofa small-molecule inhibitor that inter- 
feres with the localization of KRAS — the RAS 
isoform most commonly mutated in human 
cancers — to the plasma membrane surround- 
ing cells’. 
Following their synthesis in the cytoplasm, 


*This article and the paper under discussion? were 
published online on 22 May 2013. 


RAS proteins are initially inactive’. They then 
undergo a series of rapid post-translational 
modifications that ensure their association 
with the inner leaflet of the plasma membrane, 
where these proteins exert their normal, as well 
as their cancer-associated, signalling activity. 
Therefore, most efforts aimed at anti-RAS drug 
discovery have involved indirect approaches 
to block the activities of proteins that either 
promote plasma-membrane association of 
RAS or are components of its downstream 
signalling pathway. 

The key post-translational modification 
of RAS involves the addition of a 15-carbon 
farnesyl lipid tail in a reaction catalysed by 
the farnesyltransferase enzyme. This modi- 
fication facilitates RAS association with 
membranes and is essential for proper RAS 
localization and activity, having prompted 
intensive efforts in the 1990s to develop 
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know whether Klug and colleagues’ mecha- 
nism also applies in human cells. More gen- 
erally, the authors’ work suggests that there is 
much to learn about the feedback regulation 
of the SUMO pathway through SUMOylation 
and the many circumstances in which these 
mechanisms contribute to cellular function. m 
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farnesyltransferase inhibitors (FTIs). 

Despite promising results in preclinical 
studies, however, the results of clinical trials 
with FTIs were disappointing. The inhibitors 
blocked membrane association of the HRAS 
isoform, but lacked antitumour activity in 
cancers involving mutated KRAS (and NRAS). 
KRAS could still associate with the plasma 
membrane through an unexpected compensa- 
tory activity of the farnesyltransferase-related 
enzyme geranylgeranyltransferase-I, which 
modifies RAS with a geranylgeranyl, rather 
than a farnesyl, group. This discouraging out- 
come greatly dampened interest in targeting 
RAS — and, in particular, its membrane associ- 
ation — for cancer treatment. Instead, ongoing 
efforts have mainly focused on inhibitors 
of the RAF-MEK-ERK and the PI3K-AKT 
signalling cascades downstream of RAS. 

Zimmermann et al. describe an approach 
aimed at disrupting KRAS membrane associa- 
tion that warrants reassessment of the current 
strategies. The authors identify and charac- 
terize a small-molecule inhibitor of PDE6, 
a protein that can bind to and regulate the 
trafficking of RAS and RAS-related proteins 
to membrane compartments’ * (Box 1). Spe- 
cifically, PDE6 contains a deep, hydrophobic 
pocket capable of binding the lipid moiety of 
farnesylated proteins, in particular RAS. 

An earlier study® found that suppression 
of PDE6 levels disrupts RAS association with 
the plasma membrane and impairs the growth 
of RAS-mutant cancer cells. This finding 
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BOX1 


Inhibitors of the RAS signalling pathway 


After their synthesis, inactive RAS proteins 
undergo a rapid series of post-translational 
modifications at a CaaX tetrapeptide motif 
on their carboxy terminus (C denotes 

the amino acid cysteine; aa, 
two saturated hydrocarbon 
(aliphatic) residues; and X, a 
variable residue). The enzymes 
that mediate this modification 


include farnesyltransferase Plasma 
(FTase), which attaches a epratie 
15-carbon farnesyl lipid to RAS. 

Other modifications to this lipid Cytoplasm 


and to the CaaxX motif follow, 
before the PDE protein is 
attached to the farnesylated tail 
of RAS, thereby facilitating RAS 
accumulation at the plasma 
membrane. 

Pharmacological strategies 
to prevent activated RAS from 
reaching the plasma membrane 
include farnesyltransferase 
inhibitors such as lonafarnib 
and tipifarnib. Both of 
these are considered to be 
ineffective against KRAS 


prompted Zimmermann et al. to perform a 
high-throughput screen to identify small mol- 
ecules that could block PDE6 association with 
the farnesylated tail of KRAS. After identifying 
several hits, they took a structure-based drug- 
design approach to develop their most promis- 
ing compound, designated deltarasin. 

The researchers’ fluorescence microscopy 
experiments validate deltarasin’s ability to 
block PDE5-KRAS interaction in live cells. 
Following addition of 5-micromolar deltara- 
sin to human KRAS-mutant pancreatic-cancer 
cell lines, PDE could no longer redistribute 
KRAS to the plasma membrane. Deltarasin 
also impaired the proliferative capacity of the 
pancreatic-cancer cell lines, which depends 
on signalling of mutant KRAS. Furthermore, 
it greatly reduced KRAS-dependent signalling 
events, such as phosphorylation of the ERK1 
and ERK2 proteins. When the authors assessed 
the effects of deltarasin in vivo, in a mouse 
model of pancreatic ductal adenocarcinoma, 
they observed a dose-dependent reduction in 
tumour growth. 

Recently, there has been a resurgence of inter- 
est in targeting RAS. This is due in part to the 
findings of cancer-genomics studies, which 
have reaffirmed KRAS mutations as the pre- 
dominant oncogenic abnormalities in several 
cancers, including pancreatic, lung and colo- 
rectal cancers. However, the task of blocking 
mutant KRAS function with anticancer drugs 
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alternative modification of RAS proteins 
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remains daunting, requiring identification of 
fresh strategies. Zimmermann and colleagues’ 
finding, that pharmacologically disrupting the 
ability of PDE6 to promote plasma-membrane 
association of KRAS impairs the growth of 
KRAS-mutant pancreatic tumour cells, points 
to a provocative and innovative approach that 
may succeed where FTIs failed (Box 1). 

Indeed, the efficacy of a PDES inhibitor will 
not be circumvented by alternative prenyla- 
tion mechanisms, such as geranylgeranyla- 
tion, that prevent FTIs from blocking KRAS 
(and NRAS) association with the plasma 
membrane. Nonetheless, as is the case with 
FTIs, inhibition of RAS regulation by PDES 
will probably have unforeseen consequences. 
PDES can interact with other farnesylated 
proteins, including farnesylated RAS-family 
proteins that act as tumour suppressors (for 
example, NOEY2, also called DiRAS3). Inhi- 
bition of such beneficial proteins may lead to 
toxic effects in normal cells. 

There are also conflicting observations 
regarding PDE6 selectivity for farnesylated 
and geranylgeranylated proteins’. If PDE6 is 
also required for the trafficking of geranyl- 
geranylated proteins (such as Rho proteins), 
then additional off-target effects may be seen. 

Another unresolved issue is exactly how 
dependent RAS proteins are on PDES for 
proper localization, because even in the 
absence of PDE6, KRAS can bind to cell 
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treating the genetic disorder progeria, and 

is showing highly promising results. Another 

agent, salirasib, is a farnesyl mimic that 

competes with active RAS for binding to 

the oncoprotein’s docking 

protein, galectin, on the 

plasma membrane, leading 

to degradation of active 

cytoplasmic RAS. Salirasib 

has shown promising results 

‘ in a clinical trial for pancreatic 
cancer’®. Zimmermann et al.2 

describe a new drug candidate, 

deltarasin, which inhibits PDEO 

binding to farnesylated RAS. 

In addition to these inhibitors 
of Ras association with the 
plasma membrane, more 
than 20 inhibitors that target 
the RAF—MEK-ERK signalling 
cascade downstream of RAS 
are under clinical evaluation. 
Of these, vemurafenib, a RAF 
inhibitor, has been approved 
for use in patients with BRAF- 
mutant metastatic melanoma, 
a skin cancer. N.M.B. & C.J.D. 


Cancer 


ranylgenaryl 


membranes*®. That KRAS deficiency is lethal, 
whereas PDE6 deficiency is not’, under- 
scores the fact that at least some KRAS func- 
tions are PDE6 independent. So, although 
Zimmermann and co-authors’ data are excit- 
ing, much remains to be learnt about PDES 
function and mutant-RAS dependency on it 
for proper subcellular localization. = 
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The Norway spruce genome sequence 
and conifer genome evolution 


Lists of authors and their affiliations appear at the end of the paper 


Conifers have dominated forests for more than 200 million years and are of huge ecological and economic importance. 
Here we present the draft assembly of the 20-gigabase genome of Norway spruce (Picea abies), the first available for any 
gymnosperm. The number of well-supported genes (28,354) is similar to the >100 times smaller genome of Arabidopsis 
thaliana, and there is no evidence of a recent whole-genome duplication in the gymnosperm lineage. Instead, the large 
genome size seems to result from the slow and steady accumulation of a diverse set of long-terminal repeat transposable 
elements, possibly owing to the lack of an efficient elimination mechanism. Comparative sequencing of Pinus sylvestris, 
Abies sibirica, Juniperus communis, Taxus baccata and Gnetum gnemon reveals that the transposable element diversity 
is shared among extant conifers. Expression of 24-nucleotide small RNAs, previously implicated in transposable element 
silencing, is tissue-specific and much lower than in other plants. We further identify numerous long (>10,000 base 
pairs) introns, gene-like fragments, uncharacterized long non-coding RNAs and short RNAs. This opens up new 


genomic avenues for conifer forestry and breeding. 


Gymnosperms are a group of land plants comprising the extant taxa, 
cycads, Ginkgo, gnetophytes and conifers. Gymnosperms first appeared 
more than 300 million years ago (Myr ago)', well before the angiosperm 
lineage separated from the stem group of extant gymnosperms’. The 
major radiation of conifer families occurred 250-65 Myr ago’, and 
during their evolution the morphology of conifers has changed rela- 
tively little. There are approximately 630 conifer species, representing 
about 70 currently recognized genera, which dominate many terrestrial 
ecosystems, especially in the Northern Hemisphere. Conifers also 
dominated both before and after the major mass extinction events at 
the end of the Permian and Cretaceous periods, around 250 and 65 Myr 
ago, respectively. Conifers are of immense ecological and economic 
value; coniferous forests cover enormous areas in the Northern Hemi- 
sphere, and conifers are keystone species in many other ecosystems. 
Conifers contribute a large fraction of terrestrial photosynthesis and 
biomass, and the cultural and economic values of conifers are also para- 
mount; early civilizations used conifers for firewood, tools and artefacts 
and today several national economies depend on commodities produced 
from conifers. However, despite their abundance and importance, our 
understanding of conifer genomes is limited. Most conifers have 12 
(2n = 24) chromosomes, probably reflecting the ancestral karyotype’, 
which are typically of similar size, each being roughly comparable to the 
size of the human genome, and containing high proportions of repetitive 
elements”®. The gene space of conifer genomes has not been well char- 
acterized, although several reports have suggested that gene families in 
conifers may be larger than their angiosperm counterparts’ and that 
conifer genomes contain numerous pseudogenes*. 

Because their genomes are among the largest—typically 20-30 
gigabases pairs (Gb)—of all organisms, genome-wide analyses of con- 
ifers are particularly challenging. Thus, no full genome sequence of a 
gymnosperm species is available at present, whereas 30 angiosperm 
and more basal plant genomes have been sequenced. However, size is 
not the only challenge to sequencing presented by conifer genomes. 
Conifers are typically outbreeding, produce wind-dispersed pollen, 
have very large effective population sizes, and their genomes are 
highly heterozygous, although their nucleotide substitution rates are 
lower than those of most angiosperms*’, perhaps owing to long life- 
span (decades to centuries). Furthermore, inbreeding depression 


negates the production of inbred lines that could facilitate genome 
assembly. 

The availability of conifer genome sequences would enable com- 
parative analyses of genome architecture and the evolution of key 
traits for seed plants, including flower or fruit development and life 
history (perennial versus annual), and help to determine how and why 
conifer genomes became so large. To address these issues and aid 
forest tree breeding, biodiversity and conservation studies by, for 
example, enabling the genome-wide design of genetic markers, we 
used data from massively parallel DNA sequencing to assemble a draft 
of the 20-Gb nuclear genome of Norway spruce (Picea abies (L.) 
Karst), one of the most widespread, ecologically and economically 
important plants in Europe. We analysed the protein-coding and 
non-coding fractions of the genome and compared them to the 
low-coverage draft genome assemblies of five other gymnosperms— 
Scots pine (P. sylvestris), Siberian fir (A. sibirica), juniper (J. commu- 
nis), yew (Taxus baccata) and Gnetum gnemon—to gain insight into 
conifer genome evolution. 


Sequencing and assembly 

We sequenced a 43-year-old root-grafted copy of the P. abies clone Z4006, 
which originated from a tree in Ragunda, central Sweden, collected in 
1959. Many copies of this clone are available in clone archives and seed 
orchards, and it has been extensively used in Swedish breeding programs. 
We estimated its genome size to be 19.6Gb (C= 20.02 + 0.95 pg 
(mean = s.d.); Supplementary Information 1.1), in accordance with previ- 
ous reports”. 

De novo sequencing and assembly of large, repeat-containing, het- 
erozygous genomes remains challenging. To assemble the P. abies 
genome, we developed a hierarchical strategy combining fosmid 
pools’’ with both haploid and diploid whole genome shotgun 
(WGS) data, and RNA sequencing (RNA-Seq) data’*"'* (Supplemen- 
tary Information 1.2-1.3). The resulting assembly (P.abies 1.0) included 
4.3 Gb in >10-kilobase (kb) scaffolds (Table 1), and we estimated that 
approximately 63% of protein-coding genes!’ were fully covered 
(>90% of their length), and 96% partially covered (>30% of their 
length) within single scaffolds (Supplementary Information 1.4). By 
mapping diploid reads to the P.abies 1.0 assembly, the single nucleotide 
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polymorphism (SNP) frequency was estimated to be 0.77% and the 
short insertion/deletion (indel) frequency to be 0.05% (Supplemen- 
tary Information 1.5). 

The chloroplast genome (124kb) revealed considerable structural 
variation within the genus Picea (Supplementary Information 1.6). 
The draft mitochondrial genome (>4Mb) was among the largest 
reported for plants and was rich in short open-reading frames (ORFs), 
which appeared to be gene remnants derived from repeat-driven mito- 
chondrial rearrangements’* (Supplementary Information 1.7). 


Presence of long introns and gene-like fragments 

We generated >1 billion RNA-Seq reads and used transcript assem- 
blies of these in combination with public expressed sequence tags 
(ESTs) and transcripts to perform ab initio prediction of protein- 
coding genes, which identified a high confidence set of 28,354 loci 
with >70% coverage by supporting evidence from the total set of 
70,968 predicted loci. A notable characteristic of the predicted gene 
structures was the presence of numerous long introns (Fig. 1b), with 
mean intron length being higher than in most available plant gen- 
omes, although similar to the repeat-rich genomes of Vitis vinifera 
and Zea mays’”'*. The longest intron in the high-confidence genes 
was 68kb (Supplementary Table 2.6), and 2,384 high-confidence 
genes contained 2,880 longer than 5-kb introns (20 of which we con- 
firmed by PCR amplification; Supplementary Information 2.14), 
2,679 of which contained a repeat, suggesting that repeat insertions 
account for intron expansion. By contrast, exon size was consistent 
among the species considered (Supplementary Information 2.6.3). 
Numerous genes (~30%) remained split across scaffolds owing to 
assembly fragmentation, and as such, the longest introns were not 
represented in the P.abies 1.0 assembly. Long introns (either indi- 
vidual or cumulative intron length) did not influence expression levels 
(Fig. 1c) and introns containing repeats have not contracted despite a 
lack of recent repeat activity (see below). 

Analysis of gene families in the high-confidence gene set and seven 
sequenced plant genomes (five angiosperms: Arabidopsis thaliana, 
Populus trichocarpa, Vitis vinifera, Oryza sativa and Zea mays, and 
two basal plants: Selaginella moellendorffii and Physcomitrella patens) 
identified 1,021 P. abies-specific gene families (Fig. la and Sup- 
plementary Information 2.8). P. abies-specific families included 
over-representation of Gene Ontology categories involved in DNA 
repair and methylation of DNA and chromatin (Supplementary 
Information 2.8). As for most draft genomes, these results probably 


Table 1 | Characteristics of the P. abies genome 


Genome 
Size (1n) 19.6 Gb 
Karyotype 2n=24 
GC content 37.9% 
High-copy repeat content* 
LTR_Gypsy/Copia/unclassified 35%/16%/7% 
LINE 1% 
DNA transposable element 1% 
Unclassified 10% 
Genes and gene-like fragmentst 24% 
Assembly 
Size in scaffolds >200 bp/>10 kb 12 Gb/4.3 Gb 
N50/NG50 4,869 bp/721 bp 
Annotation 
High confidence gene set 28,354 
Genes with >5-kb introns 8.4% 


312 bp/1,017 bp 
1 gene in 705 kb 


Avg. exon/intron size 
Avg. gene density 


Transposable element genes 284,587 
Non-coding loci 
IncRNA (unique/conserved) 13,031/9,686 
miRNA (de novo predicted) 2,419 


* Inferred from unassembled reads. ‘Including pseudogenes, excluding transposable elements. 
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overestimate gene numbers’ and will be refined as we further 
improve the genome assembly. 

A common mechanism leading to genome size expansion is the 
occurrence of a whole genome duplication (WGD) event. We calcu- 
lated the number of synonymous substitutions per synonymous site 
(K,) of paralogues within the high-confidence genes but found no 
evidence for any recent WGD; there was a clear, exponential decay 
in the number of retained paralogues with increasing K, values (Sup- 
plementary Information 2.9 and Supplementary Fig. 2.6). However, a 
population dynamics model that takes into account both small- and 
large-scale modes of gene duplication” suggested the presence of a 
small peak (around K, of 1.1), which, considering the slow substi- 
tution rate of conifers, might represent the ancient WGD predating 
the divergence of angiosperms and gymnosperms (350 Myr ago”’). 

Previous examinations of small genomic subsets indicated that 
conifer genomes contain numerous pseudogenes****”’. The gene-like 
fraction of the P.abies 1.0 assembly was identified by alignment of 
RNA-Seq reads and de novo assembled transcripts (Supplementary 
Information 2.10). Within this subset of the genome, loci with valid 
spliced alignments of de novo assembled transcripts or the presence of 
a high-confidence gene were also identified. The high-confidence 
gene set represented 27 Mb of protein-coding sequence, whereas 
72 Mb of regions were identified with a valid spliced alignment or a 
high-confidence gene. In stark contrast, 524 Mb of gene-like regions 
were identified by less stringent alignments. The presence of such a 
large gene-like fraction lacking predicted gene structures supports the 
presence of numerous pseudogenes. 

Recent ENCODE publications**** characterized numerous long 
non-coding RNA (IncRNA) loci in the human genome, but this class 
of RNA remains largely uncharacterized in plants. Using short-read 
de novo transcript assemblies, 13,031 spruce-specific and 9,686 con- 
served intergenic IncRNAs were identified (Supplementary Informa- 
tion 2.4.3). In common with the ENCODE results, P. abies IncRNA 
loci contained fewer exons, were shorter (Fig. 1c), and had more 
tissue-specific expression than protein-coding loci (Supplementary 
Fig. 2.8). 

There has been conflicting evidence about the presence of 24- 
nucleotide short RNAs (sRNAs) in gymnosperms**”, a class of 
sRNA that silence transposable elements by the establishment of 
DNA methylation”. Across 22 samples, we identified numerous 
24-nucleotide sRNAs, but these were highly specific to reproductive 
tissues, largely associated with repeats but present at substanti- 
ally lower levels than in angiosperms (Fig. 1d and Supplementary 
Fig. 2.10). By contrast, 21-nucleotide sRNAs were associated with 
genes, repeats and promoters/untranslated regions (UTRs) (Fig. 1d). 
De novo microRNA (miRNA) prediction identified 2,719 loci, includ- 
ing 20 known miRNA families, with target sites predicted within 
the high-confidence gene set for 1,378 of these (Supplementary 
Information 2.13). Furthermore, 55 known miRNA families had >5 
aligned sRNA reads and mature miRNAs, representing 49 known 
families aligned to the genome (Supplementary Information 2.13). 


Conifer genomes grew by insertion of LTR-RT elements 


We constructed a manually curated library of 1,773 repetitive 
sequences, approximately half of which could be assigned to known 
transposable element repeat families (Supplementary Information 
3.1-3.3). Long terminal repeat-retrotransposons (LTR-RTs) com- 
prised the most abundant fraction of transposable elements, with 
the Ty3/Gypsy superfamily being more abundant than the Ty1/ 
Copia superfamily (Fig. 2a and Table 1). We also identified and char- 
acterized transposable elements using 454 reads from randomly 
sheared genomic DNA in five other gymnosperms (P. sylvestris, A. 
sibirica, J. communis, T. baccata and G. gnemon) and, in all six species, 
LTR-RTs were the most abundant class (Fig. 2a, Supplementary 
Information 4.1 and Supplementary Table 3.1). 


©2013 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


@ @ b 
a SO PP 
Cs & & 
+/- Gene families ga Arabidopsis thaliana 8,440 1,780 27,407 + -4 
Populus trichocarpa 8,925 2,396 40,141 tL} ei 
Vitis vinifera 7,987 2,779 26,238 F {a} emg 
Oryza sativa 10,049 7,353 41,363 tL Ff | 
+676/-298 
; Zea mays 9,847 3,785 39,172 f {} aug 
538 
3 Gee Picea abies 6,615 1,837 28,354 b- sey 
——— 16,451 SIF le oi -Wilesiaeegiimm 6,010 947 18,384 F (1- =| 
Physcomitrella patens 7,607 6,770 28,090 tLT Ff aa 
S HFEF LO ECE HL 
v 9 Y 9 SO SO 
nN Vv kon oF & > 
10% longest intron length (bp) 
c 4 d 
2 
ZNO, 008 = Promoter/UTR 
— =CDS 
cS = Repeat 
ax . 
Su 4] ° 150,000 
a 2 4 IncRNA 
5 8 i 
=> 2 7 
= 
3 100,000 
2 ae 
—4 
oe 50,000 
c 2 
Ss 
3 03 gl 
o = 
= is 
7 16 om = —— 
18 


jo} 


10 100 1,000 
Cumulative intron size (bp) 


10,000 


Figure 1 | The gene-space and transcribed fraction of the P.abies 1.0 
assembly. a, Gene family loss and gain in eight sequenced plant genomes 
(Arabidopsis thaliana, Populus trichocarpa, Vitis vinifera, Oryza sativa, Zea 
mays, Picea abies, Selaginella moellendorffii and Physcomitella patens). Gene 
families were identified using Tribe MCL (inflation value 4), and the DOLLOP 
program from the PHYLIP package was used to determine the minimum gene 
set for ancestral nodes of the phylogenetic tree. We used plant genome 
annotations filtered to remove transposable elements. ‘Orphans’ refers to gene 
families containing only a single gene. Blue numbers indicate the number of 


To trace the history of transposable elements in vascular plants we 
inferred phylogenies of a domain of the reverse transcriptase genes of 
both Ty1/Copia and Ty3/Gypsy elements. The phylogenies revealed 
several diverse and ancient transposable element subfamilies, present in 
almost all of the examined conifer genera, whereas only a few subfamilies 
were expanded in the angiosperm genomes (Fig. 2b, c and Supplementary 
Information 3.11). Most internal clades with significant bootstrap support 
were consistently species-specific, indicating that most expansions of 
extant transposable element families occurred after divergence. Two 
species-specific amplification bursts were evident: a Ty1/Copia family in 
J. communis and a Ty3/Gypsy family in T. baccata. We used complete 
LTR-RTs from P. abies and P. glauca to investigate further the timing of 
conifer transposable element insertions*’ (Supplementary Information 
3.4-3.8). In contrast to a similar set of elements identified in Oryza sativa 
and O. glaberrima (Fig. 2d), we detected no evidence of recent activity 
(that is, less than 5 Myr ago) in P. abies. Instead, insertions seem to have 
occurred over several tens of millions of years (older insertions are more 
likely to escape detection). Analysis of 68 orthologous transposable ele- 
ment insertions in P. abies and P. glauca further supported this: 63 inser- 
tions apparently predated divergence, and only five occurred after the 
lineages separated 13-20 Myr ago (Supplementary Information 3.9). 


19 20 21 22 23 24 
Size (nt) 


100,000 


gene families. b, Boxplot representation of length distribution for the 10% 
longest introns in the same eight genomes. ¢, Scatter plots of cumulative intron 
length against log) expression calculated as fragments per kilobase per million 
mapped reads (FPKM) for high-confidence gene loci (top, coloured orange) 
and green for IncRNA loci (middle, shaded green). The bottom panel shows a 
histogram of cumulative intron size in the two sets of loci. d, Distribution of 
small (18-24-nucleotide (nt)) RNAs and their co-alignment-based colocation 
to genomic features (repeats, high-confidence genes and their promoter/ 
UTRs). CDS, coding sequence. 


We clustered LTRs of complete elements to identify transpos- 
able element families*’. More than 86% of the elements remained as 
singletons, indicating that LTR-RTs are quite divergent and that 
there are several low-abundance families. We searched three LTR- 
RT families for signatures of unequal intra-element recombination 
events in scaffolds >50kb and 20 complete fosmids*’. For families 
ALISEI, 3K05 and 4D08_5 we identified 21, 22 and 39 complete 
elements, and four, five and no solo LTRs, respectively (Supplemen- 
tary Information 3.10). Although this data set is limited, the analysis 
suggested that LTR-RT-related sequences might be removed less fre- 
quently by unequal recombination than in other plant genomes. The 
ratio of solo-LTRs to complete elements in P. abies is ~ 1:9, whereas in 
A. thaliana, rice and barley it is 1:1 (ref. 33), 0.6:1 (ref. 34) and 16:1 (for 
the abundant BARE 1 element*’), respectively. Taken together, these 
findings indicated that the extant set of transposable elements in P. abies 
accumulated slowly over tens or hundreds of millions years, mainly by 
the insertion of LTR-RT elements with limited transposable element 
removal. 

An analysis of introns across taxa provided further insight into the 
genome of the last common ancestor to the conifers. We identified 
orthologues of normal sized (50-300 bp) and long (1-20 kb) introns in 
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Figure 2 | Conifer genomes contain expansions of a diverse set of LTR-RTs. 
a, Distribution of different classes of transposable elements from six 
gymnosperm species. The figure is based on the total fraction of 
transposable elements (TE) identified and grouped into different classes from 
the different species. Genome sizes of the six species are given in circles and 
their phylogenetic relationship is shown, with tentative dating of divergence 
times (x-axis) based on 64 chloroplast genes over 39 species and five fossil 
calibration points. b, c, Heuristic neighbour-joining trees constructed from 
5,922 sequences similar to the Ty3/Gypsy (b) and 3,052 sequences similar to 
the Ty1/Copia (c) reverse transcriptase domain from nine plant species. The 
trees to the right have only sequences from P. abies and Z. mays coloured, 
whereas the grey dots are the uncoloured versions of the other species 
represented on the left. d, Distributions of insertion times calculated for LTR- 
RTs in Picea abies, Picea glauca and Oryza glaberrima/O. sativa, using mutation 
rates (per base per year) of 2.2 X 10 ° for the Picea spp. and 1.8 X 10° for O. 
glaberrima”. 
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Figure 3 | Intron sizes are conserved among gymnosperms. a, b, Intron size 
comparisons between P. abies, P. sylvestris (a) and G. gnemon (b), respectively. 
Orthologues of introns that were categorised as short (50-300 bp) or long 
(1-20 kb) in P. abies were identified in P. sylvestris and G. gnemon, and the 
corresponding intron size was scored. 


spruce within draft genome assemblies of P. sylvestris and G. gnemon 
(Supplementary Information 4.2). Introns identified as orthologous to 
along intron in P. abies were also atypically long (Fig. 3a, b), suggesting 
that intron expansions started early in the history of conifers. 


The evolution of important conifer traits 

Two major differences between angiosperms and gymnosperms are 
their contrasting reproductive development and the development 
of water-conducting xylem cells. We therefore manually identified 
P. abies loci homologous to genes known to be centrally involved in 
these processes in angiosperms. 

In angiosperms, homologues of the A. thaliana phosphatidyletha- 
nolamine-binding protein (PEBP) FLOWERING LOCUS T (FT) are 
key activators of flowering. It has been suggested that gymnosperms 
lack orthologues of FT genes, instead containing a group of FT/TFL1- 
like genes that probably act as flowering repressors***’. We identified 
four putative FT/TFL1-like genes in the P.abies 1.0 genome that have 
not been previously described and confirmed that the genome does 
indeed lack FT-like genes (Supplementary Information 5.1). 

MADS-box genes are involved in controlling most aspects of angio- 
sperm development”™. A total of 278 sequences with clear homology to 
MADS boxes were identified in the P.abies 1.0 assembly (Supplemen- 
tary Information 5.2), 41 of which had transcript support. Type I and 
II MADS-box genes are distinguished in plants. Only 5% of the iden- 
tified MADS boxes were of type I (Supplementary Fig. 5.2.), the lowest 
percentage of potential type I genes recorded in any plant genome. 
Type II MADS-box genes are subdivided into about a dozen ancient 
clades. We observed remarkable expansions in the TM3-like (or 
SOC1-like), STMADS11-like and TM8-like gene clades in P. abies. 
Because members of these gene clades are involved in vegetative 
development and phase changes such as the floral transition in 
angiosperms”, we propose that the expansion of these gene clades 
has contributed to the evolution of developmental phase changes in 
gymnosperms. 

The xylem tissue of most gymnosperms comprises a single water- 
transporting cell type, tracheids. By contrast, the xylem tissue of angio- 
sperm species contains fibres, originating from tracheids that have to a 
large extent lost the capacity to conduct mass water flows, and vessels 
that have taken over the water-transport function in the stem. Formation 
of vessels is controlled by the VASCULAR NAC DOMAIN (VND) gene 
family, which has seven members in A. thaliana*®. We detected two 
VND genes in P. abies (Supplementary Information 5.3), suggesting that 
co-option and expansion of the VND gene family in vessel formation 
might have been important for angiosperm evolution. 
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A model for conifer genome evolution 


We propose the following model of conifer genome evolution. After 
the lineage that led to angiosperms had branched off and the most 
recent common ancestor of extant conifers had been established, the 12 
ancestral chromosomes expanded at a slow and steady rate due to the 
activity of a diverse set of Gypsy and Copia LTR transposable elements 
that are largely shared among extant conifers. The expansion started 
early and, in contrast to angiosperms genomes in which this has been 
counteracted by efficient recombination mechanisms” resulting in 
only smaller transposable element subsets remaining following recent 
bursts of activity"’”’, these elements have remained in the genome. We 
propose that mechanisms for transposable element removal (for 
example by unequal recombination) have been less active in conifers 
than in most other organisms”, and our data suggest that the insertion 
of transposable elements into genes gave rise to large introns, and 
(combined with other mechanisms) abundant pseudogenes. Each 
chromosome has grown to a similar size—perhaps limited by physical 
constraints on, for instance, chromosomal replication—with genes 
separated by large regions of transposable-element-rich, highly poly- 
morphic non-protein-coding regions with low recombination fre- 
quencies. The gradual increase in size, the lack of WGDs and a 
predominately out-crossing mating system have probably also buf- 
fered conifer genomes against chromosomal rearrangements (WGD 
reduces sensitivity to aneuploidy), thereby maintaining synteny over 
large phylogenetic distances™. 

Some angiospems, such as cereals, also have large genomes but it 
seems as if the “one way ticket towards genome obesity” that is 
barely recognizable in angiosperms prevails in conifers. The under- 
lying mechanism remains unclear, but the low frequency of 24- 
nucleotide sRNAs, their role in methylation of repeats and their 
restriction to reproductive tissues may have influenced the process. 
However, considering the effect of methylation patterns on recom- 
bination rates** and the fact that 24-nucleotide sRNAs trigger methy- 
lation, such low recombination frequencies would more likely result 
from hypermethylation’’. A state of ‘genome paralysis’ could poten- 
tially have been triggered once an obesity threshold was reached. In 
the angiosperm lineage, the occurrence of a number of WGDs pro- 
bably increased diversification potential, allowing morphological 
innovation (for example, the origin of flowers and fruits) and faci- 
litating speciation****’. By contrast, the conserved genome structure 
resulting from the paucity of genome rearrangements and lack of 
WGDs in conifers probably limited the evolution of reproductive 
barriers (resulting in relatively low rates of speciation), and may 
explain the high degrees of conservation through time and low mor- 
phological diversity. Nevertheless, these processes do not seem to 
affect fitness as conifers dominate many ecosystems, probably because 
they contain high degrees of standing genetic variation, allowing them 
to occupy very wide ecological niches in climatic regions where other 
plant species are less competitive. The future availability of additional 
gymnosperm genome sequences will allow further exploration of the 
unique processes that have driven their evolution and facilitate 
improvement of this important species. 


METHODS SUMMARY 


We shotgun-sequenced 450 fosmid pools containing around 100-6,000 fosmids 
per pool (Supplementary Table 1.4). Each fosmid pool was assembled and scaf- 
folded individually. Fosmid pool scaffolds larger than 1 kb (~6.7 Gb in total) were 
merged'’* (Supplementary Information 1.3) with a 38x haploid WGS assembly 
(~9.8 Gb in total, derived from ~600 ng of DNA extracted from a single mega- 
gametophyte). We subsequently performed scaffolding’ using WGS libraries of 
five different insert sizes (0.3, 0.65, 2.4, 4.4 and 10.4kb) from diploid tissue. We 
further increased assembly contiguity of protein-coding regions by scaffolding 
using a set of ~38 million unassembled (after digital normalization) RNA-Seq 
read-pairs generated from 22 samples (Supplementary Information 1.3). 

Ab initio prediction of protein-coding genes was performed using ESTs from 
numerous conifer species, our own short-read de novo transcript assemblies and 
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proteins from other plant species as supporting evidence (Supplementary 
Information 2.6). Predicted loci were used to perform gene family analysis and 
to examine the K, substitution rates of paralogues to identify evidence for a recent 
WGD event. De novo transcript assemblies were used to identify IncRNA, and 
sRNA sequencing was performed and used for de novo miRNA prediction. 

Repeated sequences were identified de novo using 454 reads longer than 700 bp 
generated from randomly sheared genomic DNA. Candidates were characterized 
using similarity searches at the nucleotide and amino acid level against public and 
custom collections of plant transposable element sequences. Complete LTR-RTs 
were identified using a combination of de novo searches and manual inspection. 

WGS assemblies from shallow sequencing (3.8-12.5X) of P. sylvestris, A. sibir- 
ica, J. communis, T. baccata and G. gnemon were produced using the CLC Bio de 
novo assembler. 

For website and accession number information, see Supplementary Information 6. 
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The importance of mixed selectivity in 
complex cognitive tasks 


Mattia Rigotti?°, Omri Barak't, Melissa R. Warden*”, Xiao-Jing Wang”°, Nathaniel D. Daw*®, Earl K. Miller* & Stefano Fusi' 


Single-neuron activity in the prefrontal cortex (PFC) is tuned to mixtures of multiple task-related aspects. Such mixed 
selectivity is highly heterogeneous, seemingly disordered and therefore difficult to interpret. We analysed the neural 
activity recorded in monkeys during an object sequence memory task to identify a role of mixed selectivity in subserving 
the cognitive functions ascribed to the PFC. We show that mixed selectivity neurons encode distributed information 
about all task-relevant aspects. Each aspect can be decoded from the population of neurons even when single-cell 
selectivity to that aspect is eliminated. Moreover, mixed selectivity offers a significant computational advantage over 
specialized responses in terms of the repertoire of input-output functions implementable by readout neurons. This 
advantage originates from the highly diverse nonlinear selectivity to mixtures of task-relevant variables, a signature 
of high-dimensional neural representations. Crucially, this dimensionality is predictive of animal behaviour as it 
collapses in error trials. Our findings recommend a shift of focus for future studies from neurons that have easily 
interpretable response tuning to the widely observed, but rarely analysed, mixed selectivity neurons. 


Neurophysiology experiments in behaving animals are often analysed 
and modelled with a reverse engineering perspective, with the more or 
less explicit intention to identify highly specialized components with 
distinct functional roles in the behaviour under study. Physiologists 
often find the components they are looking for, contributing to the 
understanding of the functions and the underlying mechanisms of 
various brain areas, but they are also bewildered by numerous obser- 
vations that are difficult to interpret. Many cells, especially in higher- 
order brain structures like the prefrontal cortex (PFC), often have 
complex and diverse response properties that are not organized ana- 
tomically, and that simultaneously reflect different parameters. These 
neurons are said to have mixed selectivity to multiple aspects of the 
task. For instance, in rule-based sensory-motor mapping tasks (such 
as the Wisconsin card sorting test), the response of a PFC cell may be 
correlated with parameters of the sensory stimuli, task rule, motor 
response or any combination of these’*. The predominance of these 
mixed selectivity neurons seems to be a hallmark of PFC and other 
brain structures involved in cognition. Understanding such neural 
representations has been a major conceptual challenge in the field. 

To characterize the statistics and understand the functional role of 
mixed selectivity, we analysed neural activity recorded in the PFC of 
monkeys during a sequence memory task’. Motivated by recent theor- 
etical advances in understanding how machine learning principles 
play out in the functioning of neuronal circuits*”°, we devised a new 
analysis of the recorded population activity. This analysis revealed that 
the observed mixed selectivity can be naturally understood as a sig- 
nature of the information-encoding strategy of state-of-the-art classi- 
fiers like support vector machines"’. Specifically we found that (1) the 
populations of recorded neurons encode distributed information that 
is not contained in classical selectivity to individual task aspects, (2) the 
recorded neural representations are high-dimensional, and (3) the 
dimensionality of the recorded neural representations predicts beha- 
vioural performance. 


Dimensionality and mixed selectivity 


The dimensionality of a neural representation in a given time bin is a 
property ofa set of vectors, each of which represents the firing rates of 
N recorded neurons in one experimental condition. The pattern of 
activity encoded in each such vector can be thought of as a point in an 
N-dimensional space. Over a set of such points, the dimensionality we 
refer to is defined as the minimal number of coordinate axes that are 
needed to specify the position of all points (Supplementary Methods 
M.1). For example, if all points are on a line, then their dimensionality 
is one, as one appropriately aligned axis is sufficient to determine their 
position. The dimensionality of the neural representations recorded 
during an experiment is then the dimensionality generated by the 
patterns of activity observed in each of the different experimental 
conditions (for example, all combinations of sensory stimuli and 
behavioural responses). 

High-dimensional neural representations have the desirable property 
of allowing simple readouts such as linear classifiers to implement a large 
set of input-output relations. Model circuits that rely on such high- 
dimensional representations can generate very rich dynamics and solve 
complex tasks*"°, and this same property is exploited in contemporary 
machine learning techniques such as support vector machines. 

This dimensionality is related to the mixed selectivity of neuronal 
responses. This is because a set of neurons whose responses are select- 
ive only to individual task-relevant aspects, or even to linear sums of 
multiple aspects (linear mixed selectivity), can only generate low- 
dimensional representations. Higher dimensional representations 
can be produced by including neurons whose responses cannot be 
explained as a linear sum of aspect-related responses, that is, neurons 
with nonlinear mixed selectivity (Fig. la, b). 


Mixed selectivity in a memory task 


Monkeys were trained to remember the identity and order of pre- 
sentation of two objects sequentially displayed on a screen. Their 
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Figure 1 | Low and high-dimensional neural representations, and mixed 
selectivity. a, Contour plots of the responses (spikes per s) of four hypothetical 
neurons to two continuous parameters that characterize two task-relevant 
aspects (a,b, varying between 0 and 1) corresponding to relevant stimulus 
features (for example, contrast and orientation). Neurons 1,2 are pure 
selectivity neurons, selective to individual parameters (a and b, respectively). 
Neuron 3 is a linear mixed selectivity neuron: its response is a linear 
combination of the responses to parameters a and b. Neuron 4 is a nonlinear 
mixed selectivity neuron: its response cannot be explained by a linear 
superposition of responses to the individual parameters. The green circles 
indicate the responses to three sensory stimuli parameterized by three a,b 
combinations. b, The responses of the pure and linear mixed selectivity neurons 
from a in the space of activity patterns (the axes indicate the firing rates of the 
neurons) elicited by the three stimuli indicated by the green circles in a lie ona 
line, therefore spanning a low-dimensional space. c, As in b, with the third 
neuron being the nonlinear mixed selectivity Neuron 4 in a. The 
representations of the stimuli lie on a plane, no longer being confined on a line. 
This higher dimensionality has an important role when the activity is read out 
by linear classifiers, because they can only separate the input space into classes 
that are separable by a plane (in general by a hyper-plane). This limits the 
implementable classifications (See Supplementary Section S.1). For example, in 
b it is impossible for any linear classifier to respond to the darker central circle 
and not to the other two. But it is possible in c, for instance for a linear classifier 
corresponding to an appropriately positioned horizontal plane. 


memory was then tested after a delay (the two-object delay period) 
either through a recognition or through a recall task, which were inter- 
leaved in blocks of 100-150 trials (Fig. 2 and refs 3, 12 for more details). 

We analysed the activity of 237 lateral PFC neurons (area 46) 
recorded in two monkeys during the trial epochs that precede and 
include the two-object delay period. In these epochs, each trial was 
characterized by a condition defined by three relevant aspects: the 
identity of the two visual objects and the task type. The first cue 
was randomly selected from a predefined set of four objects. The 
second cue was randomly chosen among the three remaining objects. 
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Figure 2 | Behavioural task from ref. 3. a, Sample sequence: each trial began 
when the monkeys grasped a bar and achieved central fixation. A first sample 
object was followed by a brief delay (the one-object delay), then a second sample 
object (different from the first sample object), then another delay (the two- 
object delay). b, Recognition task: the sample sequence was followed by a test 
sequence, which was either a match to the sample sequence, in which case the 
monkeys were required to release the bar, or a non-match, in which case the 
monkeys were required to hold the bar until a matching sequence appeared. 
c, Recall task: the sample sequence was followed by an array of three objects that 
included the two sample objects. Monkeys were required to make a sequence of 
saccades in the correct order to the two sample objects. Recognition and recall 
task trials were interleaved in blocks of 100-150 trials. 


Finally, the task type was either recognition or recall. More generally, 
for other experimental protocols, every situation (condition) would be 
characterized by the values of a set of discrete or continuous variables. 
We refer to them as to the task-relevant aspects. 

The analysis of recorded single-neuron responses shows that the 
majority of neurons are selective to at least one of the three task- 
relevant aspects in one or more epochs’. A large proportion of neu- 
rons, moreover, show nonlinear mixed selectivity (Supplementary 
Section S.2). Figure 3a-d show two examples of nonlinear mixed 
selectivity neurons. Figure 3a shows a cell that is selective to a mixture 
of cue 1 identity and task type: it responds to object C when presented 
as a first cue, more strongly during the recognition task. The neuron of 
Fig. 3c, d is mostly selective to objects A and D when presented as 
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Figure 3 | Mixed selectivity in recorded single-cell activity and population 
decoding. a, Average firing rate of a sample neuron (Gaussian smoothing with 
100-ms window, shaded area indicates s.e.m.). Colours denote different 
combinations of task type and sample cues (condition), indicated in parenthesis 
(task type, first cue, second cue). The ‘?’ indicates that cue 2 identities were 
averaged over. This neuron preferentially responds to object C as first cue in 
task 1 blocks (recognition task). b, Peri-condition histogram (PCH): average 
firing rate in a 100-ms time bin (+s.e.m.) at the yellow arrow in a for different 
conditions. The response to object C as first cue is significantly different for the 
two task types (P < 0.05, two-sample t-test). c, d, Same as a, b, for a different 
neuron with preference for object A and D as second objects during task 2 trials 
(recall task). e-g, Comparison of population decoding accuracy for task type 
(e), cue 1 (f) and cue 2 (g) before (dashed) and after (solid) removing classical 
selectivity. Dashed lines: average trial-by-trial cross-validated decoding 
accuracy of the decoder reading out the firing rate of 237 neurons in different 
independent time bins. Curves represent the average decoding accuracy over 
1,000 partitions of the data into training and test set (shaded areas show 95% 
confidence intervals). Horizontal dashed lines indicate chance level. Solid lines: 
decoding accuracy after the removal of classical selectivity for 237 (bright) and 
1,000 resampled neurons (dark) (see Supplementary Methods M.6). 

e, Accuracy in decoding task type from neurons whose selectivity to task type 
was removed. The decoding accuracy is initially at chance level, but steadily 
grows above chance level as the complexity of the task and the number of 
conditions increases. f, g, Analogous plots for the decoding accuracy of cue 1,2 
identity, when instead selectivity to cue 1,2 was removed. 


second stimuli, but only during the recall task and when they are 
preceded by object C. 


Information encoded by mixed selectivity 


A neuron is conventionally said to be selective to a task-relevant 
aspect if it responds differentially to the values of the parameters 
characterizing that aspect. A neuron is, for instance, selective to task 
type, if its average responses in recall and recognition task trials are 
significantly different. The operation of averaging over all conditions 
corresponding to a particular task type (for each task type there are 12 
possible combinations of the sample visual objects) may, however, 
result in discarding important information. The responses in indi- 
vidual conditions could encode information about task type through 
the nonlinear interactions between the cue and the task type aspects, 
which manifest themselves as nonlinear mixed selectivity. This sug- 
gests that if nonlinear mixed selectivity is sufficiently diverse across 
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neurons, the information about task type could be extracted from the 
covariance between neuronal responses across different conditions, 
even when individual neurons are not ‘classically’ selective to task type 
(that is, the average responses to recall and recognition tasks are not 
significantly different). Information could in other words be distrib- 
uted across the neural population, even when it is not present in 
individual cells (see also refs 13, 14 for recent discussions on distrib- 
uted codes.) 

To show that this is indeed the case, we manipulated the data to 
remove the classical selectivity to a given task aspect and then tested 
whether that task aspect could still be decoded from the population. 
Classical selectivity to a given task-relevant aspect is removed from 
every recorded neuron by adding noise that equalizes average res- 
ponses, preserving the differences between the individual conditions 
(Supplementary Methods M.3). In Fig. 3e-g we compared the cross- 
validated trial-by-trial population-decoding accuracy before and after 
removing classical selectivity. Neurons that were not recorded simul- 
taneously were combined as pseudo-simultaneous population activity 
patterns as explained in Supplementary Methods M.5. The temporal 
correlations that were neglected with this procedure do not seem to 
appreciably affect the decoding accuracy (Supplementary Section S.4). 

Before removing classical selectivity, the maximal cross-validated 
decoding accuracy peaks close to 100% for all task-relevant aspects 
(Fig. 3e). Both the identity and the temporal order of the visual objects 
could also be decoded (see Supplementary Section S.6). Crucially, all 
task-relevant aspects can be decoded even when classical selectivity is 
removed. The first panel of Fig. 3e shows the accuracy of decoding 
task type from the intact population and after removing classical 
selectivity to task type from all neurons. Note that removing classical 
selectivity causes a larger drop of decoding accuracy in the early 
epochs of the trial. As the trial progresses and more visual cues are 
memorized (that is, the task becomes more complex), the accuracy 
progressively increases towards the values of the intact population. 
Moreover, the decoding accuracy increases as the number of neurons 
read out by the decoder increases. We estimated the decoding accu- 
racy for larger neural populations by resampling the recorded neurons 
and randomly relabelling the identities of the visual objects, so as to 
obtain responses whose activity have the same statistics as the 
recorded ones. For example, a new neuron could be obtained by 
assigning the activity of a recorded neuron in response to objects A, 
B, C, D to the trials in which the objects were B, D, A, C (see 
Supplementary Methods M.3 and M.6 for more details). Similar 
results hold after removing the classical selectivity to cue 1 and cue 
2 (Fig. 3f, g), or when we removed the classical selectivity by sub- 
tracting from the neural activity the linear mixed selectivity compon- 
ent (Supplementary Section S.3). 


Neural representations are high-dimensional 


To verify that the observed nonlinear mixed selectivity and the divers- 
ity of the neural responses are a signature of the high-dimensionality 
of the neural representations, we set out to quantify the dimension- 
ality of the recorded activity. As this is notoriously difficult in the 
presence of noise (see Supplementary Section S.10 and ref. 15), we 
adopted a novel strategy that exploits the relation between dimension- 
ality and the performance of a linear classifier reading out the neural 
activity (Supplementary Section S.7). Our method relies on the obser- 
vation that the number of binary classifications that can be imple- 
mented by a linear classifier grows exponentially with the number of 
dimensions of the neural representations of the patterns of activities 
to be classified (Supplementary Methods M.1). Hence dimensionality 
can be estimated by counting the number of binary classifications that 
can be implemented by a linear classifier. The exponential depend- 
ence on the dimensionality implies that the number of implementable 
classifications can vary over several orders of magnitude, allowing 
for a huge computational advantage of high-dimensional over low- 
dimensional representations. 
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In general, a binary linear classifier reads out the activity of the 
recorded neurons and generates an output that can be either 0 or 1 
depending on whether the sum of its weighted inputs is below or 
above a threshold. The set of weights determines the specific clas- 
sification that is implemented. In our analysis, the classifier is trained 
to consistently generate the same output for all recorded trials cor- 
responding to the same experimental condition. The output can be 
different for the c different conditions (here, c equals 24 possible 
combinations of values for the three task aspects), for a total of 2° 
possible binary classifications or, equivalently, mappings from the c 
conditions to the binary outputs. Among all possible binary classifi- 
cations we estimated through cross-validation how many are imple- 
mentable by a linear classifier of the recorded patterns of activity 
(Supplementary Methods M.7). The total number of implementable 
binary classifications N, is related to the estimated dimensionality d of 
the representations through the expression d= log, N,, when the 
number of inputs is sufficiently large (Supplementary Methods M.1 
and Supplementary Section S.7). Accordingly, the dimensionality is 
bounded by the total number of distinct conditions c. 

Figure 4 shows both N, and d as a function of the number of 
neurons N read out by the classifier for two different neural repre- 
sentations. The first neural representation is given by the recorded 
PFC neurons. For values of N larger than the number of recorded cells, 
we computed the performance by introducing additional resampled 
neurons as previously described for the decoding analysis 
(Supplementary Methods M.6), after verifying that this does not 
introduce additional artificial dimensions to the neural representa- 
tions (Supplementary Section S.8). The second neural representation 
is generated from simulated pure selectivity neurons that encode only 
one aspect of the task at a time (Supplementary Methods M.4). We 
computed N, for these neural representations during two task epochs: 
the one-object (Fig. 4a) and the two-object delay (Fig. 4b). For both 
epochs, N. grows with the size of the neural population N and it 
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Figure 4 | The recorded neural representations are high-dimensional. 
Number of implementable binary classifications, N,, (left ordinate, on 
logarithmic scale) and dimensionality of the inputs (right ordinate, linear scale) 
for varying number of neurons of the population read out by a linear classifier. 
a, The black trace represents the number N, of implementable binary 
classifications of the vectors of recorded mean firing rates in the 800-ms bin in 
the middle of the one-object delay period. In this epoch a trial is defined by one 
of c = 8 different conditions, corresponding to all the combinations of task type 
and cue 1 objects. N, reaches the value that corresponds to the maximal 
dimensionality d= 8 (indicated by the dashed line). The grey line shows N, 
when the neural representations contain only the responses of artificially 
generated pure selectivity neurons with a noise level matching that of the data 
(See Supplementary Methods M.4). b, Same plot computed over the 800-ms bin 
in the middle of the two-object delay period. The advantage of the recorded 
representations over the pure selectivity neurons is huge. For the recorded data 
(black line) N, reaches 274, the value that corresponds to the maximal 
dimensionality d = 24, given by all possible combinations of cue 1 object, cue 2 
object and task type are 24 (dashed line). On the other hand, representations 
based on pure selectivity (grey line) generate less than 8 dimensions. Error bars 
are 95% confidence bounds estimated as detailed in Supplementary Methods 
M.7. See Supplementary Section S.20 for this analysis during the test epochs. 
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saturates near the value that corresponds to the maximal dimension- 
ality. The asymptotic value of N, is always larger for the recorded 
representations than for the pure selectivity representations. The dif- 
ference is several orders of magnitude for the two-object delay. 

The ability to implement such a large number of classifications is 
due to the diversity of nonlinear mixed selectivity responses, which 
often results in seemingly disordered representations and response 
properties that are not easily interpretable. However, it is important 
to note that high-dimensional representations could also originate 
from more ‘orderly’ responses, in which each neuron behaves as a 
“grandmother cell’ that responds only to a single experimental con- 
dition—in our case to one out of 24. We ruled out this scenario in our 
data by verifying that PFC representations are rather dense (Sup- 
plementary Section S.11) and that the sparsest responses are not a 
major contributor to the observed high dimensionality (Supplemen- 
tary Section S.19). 


Dimensionality predicts animal performance 


High-dimensional neural representations encode information that is 
nonlinearly mixed in a format that is suitable for local neural proces- 
sing. Therefore high dimensionality could be important for the dyna- 
mics of neural circuits that eventually generate behaviour*’? (see also 
Supplementary Section S.1 for an illustration of the computational 
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Figure 5 | The dimensionality of the neural representations predicts animal 
behaviour. a, Same plot as Fig. 4b, with the difference that the analysis is 
restricted to the recall task and the two curves represent the number of 
implementable binary classifications of the recorded activity in the case of 
correct (black) and error (grey) trials. For the correct trials the number of 
implementable classifications corresponds to a dimensionality that is close to 
maximal (d = 12, dashed line). In the error trials the dimensionality drops 
significantly. b, The identity of the two cues can still be decoded in the error 
trials: decoding accuracy as in Fig. 3 in the correct (continuous lines) and error 
trials (dashed lines) for the identity of cue 1 (green lines) and cue 2 (orange 
line). The correct cue identities are perfectly decoded also during error trials. 
The error bars (shaded areas) are computed as in Fig. 3e-g. c, d, Contribution of 
nonlinear and linear mixed selectivity to the collapse in dimensionality 
observed in the error trials. c, After removing the linear component of mixed 
selectivity from the response of each neuron, the dimensionality is estimated as 
in a. The dimensionality in the correct trials (black line) is still significantly 
higher than in the error trials (grey line). d, Same as in c, but after the nonlinear 
component of mixed selectivity is subtracted from each neuron. The two curves 
are not significantly different, indicating that the nonlinear component of 
mixed selectivity is responsible for the collapse in dimensionality. These 
analyses were carried out on a subset data set of 121 neurons that were recorded 
in as many correct trials as error trials during the recall task. a, c and d, Error 
bars are 95% confidence bounds estimated as detailed in Supplementary 
Methods M.7. 
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advantages of high-dimensional over low-dimensional representa- 
tions). If this is the case, we should observe a correlation between 
the dimensionality of the neural representations and the performance 
of the animal. In particular, the dimensionality should decrease in 
error trials. 

We tested this prediction by analysing the errors made by the 
monkeys in the recall task (there were too few errors in the recognition 
task). Figure 5a shows the number of implementable binary classifi- 
cations and the dimensionality as a function of the number of neurons 
in the input in the 800-ms time bin in the middle of the two-object 
delay period, as in Fig. 4b, but only during the recall task and sepa- 
rately for correct and error trials. As predicted, the dimensionality 
decreases in the error trials. We ruled out that the decrease could be 
explained by a difference in the number of trials (for the 121 neurons 
with enough error trials, the difference between the number of correct 
and error trials is less than 4%), a change in average firing rate 
(Supplementary Section S.13), in the variability of single neuron acti- 
vity (Supplementary Section S.14) or in the coding level of the activity 
(Supplementary Section S.15). 

Remarkably, the identity of the two visual cues could still be 
decoded with high accuracy in the error trials. Figure 5d shows that 
the cross-validation performances of the decoder of Fig. 3 trained on 
correct trials and tested on a hold-out set of correct (continuous line) 
and error trials (dashed lines) are indistinguishable, demonstrating 
that when the monkeys make a mistake, cue identities are encoded 
with the same strength as in the correct trials. We verified that this 
correspondence is not a ceiling effect due to the population decoder 
saturating at high performance (Supplementary Section S.16). 

These results indicate that the collapse in dimensionality observed 
in the error trials is not due to a failure in coding or remembering the 
sensory stimuli. Our hypothesis is that nonlinear mixed selectivity, 
which underlies high dimensionality, is important for the generation 
of the correct behavioural response. In the error trials this component 
of the selectivity is disrupted, leading to a collapse in dimensionality, 
which impairs the ability of downstream readout neurons to produce 
the correct response. 

To test this hypothesis, we quantified the contribution to the col- 
lapse in dimensionality of the nonlinear mixed selectivity component 
of the neuronal response. The nonlinear mixed selectivity component 
represents the signal that is not described by a linear mixing model 
(Supplementary Methods M.2). The collapse in dimensionality that 
predicts the errors of the animal is specifically due to a weakening of 
the nonlinear component of mixed selectivity. Indeed, the difference 
in dimensionality between correct and error trials still remains sig- 
nificant after subtracting the linear component of mixed selectivity 
(Fig. 5c), whereas it is negligible when the nonlinear mixed selectivity 
component is removed (Fig. 5d). These results are confirmed by a 
principal component analysis (Supplementary Section S.17). 


Discussion 


We showed that the mixed selectivity that is commonly observed in 
PFC responses can be interpreted as a signature of high-dimensional 
neural representations. One advantage of high dimensionality is that 
information about all task-relevant aspects and their combinations is 
represented in a way that is easily accessible to linear classifiers, such 
as simple neuron models. The information is distributed across mul- 
tiple neurons in an ‘explicit’ format’® that allows a readout neuron to 
implement an arbitrary classification of its inputs. Previous studies 
have already shown that a linear readout is often sufficient to decode 
particular task aspects or to perform specific tasks (see for example 
refs 17, 18). Here, by showing that the neural representations are high- 
dimensional, we demonstrate that any binary choice task involving 
the 24 experimental conditions that we analysed could be performed 
by a linear readout. 

One of our main results is that the dimensionality of the neural 
representations collapses in error trials, indicating that nonlinear 
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mixed selectivity might be important for generating correct beha- 
vioural responses. It is tempting to speculate about the causes of this 
dimensionality collapse. Nonlinear mixed selectivity can change in a 
way that is compatible with our observations when neurons integrate 
multiple sources of information, which include those that are relevant 
for the task and those that are not under experimental control. The 
change in dimensionality may be caused by the excessive variability of 
sources that are not task-relevant. In other words, to perform cor- 
rectly, the brain has to mix nonlinearly the task-relevant sources of 
information in a way that is consistent across trials. This consistency 
requires to restrict the contribution of the other sources. This is sim- 
ilar to what has been observed in the premotor cortex, where firing 
rates tended to be less variable on trials in which the reaction time 
was shorter’’. A theoretical argument (Supplementary Section S.18) 
shows that neurons with a strong nonlinear mixed selectivity are more 
sensitive than pure selectivity neurons to the task-irrelevant sources of 
variability. Nonlinear mixed selectivity is most useful but also most 
fragile. Pure and linear mixed selectivity, which are more robust, make 
it possible to decode individually all task-relevant aspects even in the 
error trials, as observed here. 

Although high dimensionality is not strictly necessary for generat- 
ing rich dynamics and performing complex tasks, it is known to 
greatly simplify the design of local neural circuits’. Indeed, realizing 
a complex and rich dynamics is for some model circuits equivalent to 
solving a classification problem in which the network has to generate a 
particular output for each input. In these models this is typically 
realized by training a subset of neurons to respond in a specific way 
to an external input or to the internally generated activity. This is 
equivalent to classifying the activity of the input neurons for every 
time step. In many situations this activity is read out by downstream 
circuits. In others it is fed back to the neural circuit to affect its 
dynamics and hence the statistics of future inputs. Especially in the 
latter situations, the number of input-output functions or classifica- 
tions that must be implemented by each neuron can be significantly 
larger than the number of functions required to simply produce the 
observed final behavioural response, because the neurons are required 
to generate the proper output for every time step. For this reason, it is 
often necessary to expand the dimensionality of the neuronal repre- 
sentations of the external sensory input and the internal state. In 
recent models*"°, the dimensionality of the neuronal representations 
is expanded by mixing in a nonlinear way the different sources of 
information in a population of randomly connected neurons. The 
resulting neuronal representations are high-dimensional (see for 
example ref. 20), like those observed in PFC, and consistent with high 
dimensionality, the neurons show mixed selectivity which is diverse 
across time (that is, in different epochs of the trials) and space (that is, 
across different neurons). Random connectivity in small brain regions 
has been suggested on the basis of anatomical reconstructions”! and 
recently observed in the connections from the olfactory bulb to the 
olfactory cortex” (see also ref. 14 for a general discussion). 

We showed that the recorded mixed selectivity can be useful for the 
activity to be linearly read out. It is legitimate to ask whether these 
considerations would still be valid if we consider more complex non- 
linear readouts. For example, some of the transformations which 
increase the dimensionality of the neural representations could be 
implemented at the level of individual neurons by exploiting dendritic 
nonlinearities. Our results do not exclude the functional importance 
of such dendritic processes. They do, however, tend to argue against a 
scenario where all important nonlinear transformations are carried 
out at the level of single neurons, a situation where dimensionality 
expansion could happen in a ‘hidden way’, and the observable repre- 
sentations provided by the neuronal firing rates could therefore be 
low-dimensional. 

Finally, the particular form of redundancy inherited from high- 
dimensional representations allows neural circuits to flexibly and 
quickly adapt to execute new tasks, just as it allows them to implement 
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arbitrary binary classifications by modifying the weights of a readout 
neuron (using, for instance, a supervised procedure like the perceptron 
learning rule’’). In Supplementary Section S.9 we show an example of 
this flexibility by training a simulated neuron to perform a new virtual 
task based on the recorded activity. High dimensionality might there- 
fore be at the basis of the mechanisms underlying the remarkable 
adaptability of the neural coding observed in the PEC? and, as such, 
be an important element to answer fundamental questions that try to 
map cognitive to neurophysiological functions. 

In conclusion, the measured dimensionality of the neural represen- 
tations in PFC is high, and errors follow a collapse in dimensionality. 
This provides us with a motivation to shift the focus of attention from 
pure selectivity neurons, which are easily interpretable, to the widely 
observed but rarely analysed mixed selectivity neurons, especially in 
the complex task designs that are becoming progressively more 
accessible to investigation. 


METHODS SUMMARY 


The formal definitions of dimensionality and mixed selectivity are in Supplemen- 
tary Methods M.1 and M.2, respectively. The procedures for removing selectivity, 
decoding task-relevant aspects and resampling neurons used in Fig. 3 are 
explained in Supplementary Methods M.3, M.5 and M.6. The dimensionality 
estimate of Fig. 4 is detailed in Supplementary Methods M.7. The analysis of 
the linear and nonlinear components of Fig. 5 is in Supplementary Methods M.8. 
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An anti-glitch in a magnetar 


R.F. Archibald', V. M. Kaspil, C.-Y. Ng’, K.N. Gourgouliatos', D. Tsang’, P. Scholz', A. P. Beardmore’, N. Gehrels* & J. A. Kennea” 


Magnetars are neutron stars with X-ray and soft y-ray outbursts 
thought to be powered by intense internal magnetic fields’. Like 
conventional neutron stars in the form of radio pulsars, magnetars 
exhibit ‘glitches’ during which angular momentum is believed to be 
transferred between the solid outer crust and the superfluid com- 
ponent of the inner crust**. The several hundred observed glitches 
in radio pulsars*® and magnetars’ have involved a sudden spin-up 
(increase in the angular velocity) of the star, presumably because 
the interior superfluid was rotating faster than the crust. Here we 
report X-ray timing observations of the magnetar 1E2259+586 
(ref. 8), which exhibited a clear ‘anti-glitch’—a sudden spin-down. 
We show that this event, like some previous magnetar spin-up 
glitches’, was accompanied by multiple X-ray radiative changes 
and a significant spin-down rate change. Such behaviour is not 
predicted by models of neutron star spin-down and, if of internal 
origin, is suggestive of differential rotation in the magnetar, sup- 
porting the need for a rethinking of glitch theory for all neutron 
stars’), 

1E2259+586 is a magnetar with a rotation period of about 7 s, with 
a spin-inferred surface dipolar magnetic field strength of 5.9 X 10'°G. 
Over 16 years of monitoring with the Rossi X-ray Timing Explorer, 
1E2259+586 has shown a very stable spin-down rate and pulsed flux, 
with the exception of two spin-up glitches in 2002 (ref. 9) and 2007 
(ref. 12), and a small timing event in 2009 (ref. 12).The 2002 glitch was 
also accompanied by an increase in X-ray luminosity by a factor of 20 
(ref. 9) and X-ray bursts"’, neither of which was seen in the 2007 glitch. 


We began monitoring 1E2259+586 with NASA’s Swift X-ray 
Telescope“ in July 2011. Observations were made every 2-3 weeks, 
with typical exposure times of 4ks. From each observation, we 
obtained a pulse time-of-arrival (TOA) by folding the X-ray time series 
at the current pulse period and aligning this folded light curve with a 
high signal-to-noise template. 

We fitted the pulse TOAs to a long-term timing model that keeps 
track of every rotation of the neutron star. This model predicts when 
the pulses should arrive at Earth, taking into account the pulsar rota- 
tion as well as astrometric terms. We compared the observed TOAs 
with the model predictions, and obtained best-fit parameters by 77 
minimization, using the TEMPO2”° software package. Until the obser- 
vation on 14 April 2012 (modified Julian date, MJD 56,031.18), these 
TOAs were well fitted using only a frequency and first frequency 
derivative, as shown in Fig. 1. 

The subsequent data, however, clearly were not predicted by this 
simple model. TOAs starting on 28 April 2012 (MJD 56,045.01) 
showed an apparently instantaneous change of the frequency—which 
we dub an ‘anti-glitch’. On 21 April 2012 (MJD 56,038), consistent 
with the epoch of this sudden spin-down, a 36-ms hard X-ray burst 
was detected by the Fermi Gamma-ray Burst Monitor'®, with a fluence 
of about 6 X 10 *ergcm 7 in the 10-1,000 keV range. No untriggered 
bursts from the Gamma-ray Burst Monitor were seen within three 
days of the observed burst’®. Also, on 28 April 2012 (MJD 
56,045.01), coincident with the nearest post-anti-glitch observation, 
we detected an increase in the 2-10-keV flux by a factor of 2.00 + 0.09 
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Figure 1 | Timing and flux properties of 1E2259+586 around the 2012 
event. a, 1E 2259+586’s spin frequency as a function of time, determined by 
short-term fitting of (typically) five TOAs. The grey horizontal error bars 
indicate the ranges of dates used to fit the frequency, and the vertical error bars 
(generally smaller than the points) are standard 1o uncertainties. The red and 
blue solid lines in a represent the fits to the pulse TOAs, as displayed in Table 1, 
with red representing model 1, and blue model 2. b, Timing residuals 
(differences between the initial model and observed data) of 1E 2259+586 after 


fitting only for the pre-anti-glitch timing solution. The inset shows the same 
timing residuals, zooming in on the anti-glitch epoch. c, The absorbed 2-10- 
keV X-ray flux. The error bars indicate the 1o uncertainties, and the green line 
is the best-fit power-law decay curve with an index of -0.38 + 0.04. The dashed 
vertical lines running through all panels indicate the glitch epochs, the black 
line being the anti-glitch, and blue and red lines the second event in the models 
shown in Table 1. The timing residuals for these fits can be seen in the 
Supplementary Information. 
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(see Fig. 1). The 2-10-keV flux increase was also accompanied by a 
change in the hardness ratio, defined as the ratio of the 4-10 keV flux 
to the 2-4keV fluxes, from 0.10 + 0.02 to 0.18 + 0.02. This flux 
increase subsequently decayed, following a power-law model with 
a% = -0.38 + 0.04 (see Fig. 1). The flux increase was accompanied by 
a moderate change in the pulse profile: the addition of a sinusoid 
centred between the usual two peaks in the pulse profile. This modified 
pulse profile relaxed back to the usual shape on a timescale similar to 
that of the flux. We verified that this profile change did not affect the 
TOAs determined near the anti-glitch epoch. 

This remarkable spin-down event was immediately followed by an 
extended period of enhanced spin-down rate. This anti-glitch and 
spin-down rate change can be modelled well by an instantaneous 
change in the frequency and frequency derivative, followed by a second 
sudden event. We have found two possible timing models to describe 
the pulsar’s behaviour, described in full in Table 1. In the first, there is 
an instantaneous change in frequency and frequency derivative by 
Av = -4.5(6) X 10° °Hz (where Av/v = -3.1(4) X 10-7) and Aji= 
—2.7(2) X 107 '*Hzs~' on 18 April (MJD 56,035(2)). This enhanced 
spin-down episode ended with a second glitch, this time a spin-up 
event, of amplitude Av = 3.6(7) X 107° Hz (Av/v = 2.6(5) X 10°”) and 
Av = 2.6(2) X 10° *Hzs*. 

In the second model, the spin evolution can be described by two 
anti-glitches, instead of an anti-glitch/glitch pair. In this model, a 
change of Av = -9(1) X 10 °Hz (Av/v = -6.3(7) X 10”) and Av= 
—1,3(4) x 10° '*Hzs' occurred on 21 April (MJD 56,038(2)). This 
period ended with a second anti-glitch of amplitude Av = -6.8(8) x 
10 °Hz (Av/v = 4.8(5) X 10 7) and Av=1.1(4) X 10 *Hzs 1. 

The full timing parameters for both possible models are presented in 
Table 1. Note that neither model is preferred on statistical grounds; 
however, models involving a single initial anti-glitch and subsequent 
relaxation with no second impulsive event are ruled out to high con- 
fidence. We also note that no significant radiative, or profile, changes 
can be associated with either of the possible second impulsive events. 

In either model a sudden spin-down at the epoch of the Fermi burst 
is unambiguously required to model the observed TOAs properly. 
Although the amplitude of this anti-glitch in either model is not 
unusual, the fact that it is a sudden spin-down is remarkable. The 
net effect of this active period are changes to the spin frequency and 
its first derivative Av = -2.06(8) X 10 7 Hz (Av/v = -1.44(6) X 10°) 
and a Aj = —1.3(4)X 10 Hzs 1. 


Table 1 | Timing parameters for 1E 2259+586 


Parameter 


Value 


23 July 2011 to 1 January 2013 


Observation dates 


Dates (MJD) 55,765.829 to 56,293.332 
Epoch (MJD) 55,380.000 
Number of TOAs 51 
v(s}) 0.143, 285, 110(4) 
i(s~?) -9.80(9) x 10°15 
Model 1 
Glitch epoch 1 (MJD) 56,035(2) 
Ay (s+) -4.5(6) x 10°® 
Ai (s~?) -2.7(2) x 107-14 
Glitch epoch 2 (MJD) 56,125(2) 
Avo (s+) 3.6(7) x 10-8 
Aiz(s~?) 2.6(2) x 10°14 
Root-mean-square residuals (ms) 56.3 
y/v 45.4/44 
Model 2 
Glitch epoch 1 (MJD) 56,039(2) 
Ay, (s+) -9(1) x 10 ® 
Avy (s~?) -1.3(4) x 10°14 
Glitch epoch 2 (MJD) 56,090(3) 
Avo (s+) -6.8(8) x 10°® 
Avo (s~2) 1.1(4) x 10-4 
Root-mean-square residuals (ms) 515 
7/v 38.1/44 


Numbers in parentheses are TEMPO2-reported 1o uncertainties. 
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Sudden spin-down glitches have not previously been observation- 
ally demonstrated, though some magnetar events have been sugges- 
tive. A spin-down in magnetar SGR 1900+ 14 (ref. 17) occurred during 
an 80-day gap in the source monitoring, but could have been a gradual 
slowdown, as was also possible for the 2009 timing event in 
1E2259+586 (ref. 12). Net spin-downs have been seen in magnetar 
4U0142+61 (ref. 18) and in the high-magnetic-field-rotation- 
powered pulsar PSRJ1846-0258 (ref. 19) but were due to spin-up 
glitch over-recoveries on timescales of 17 and 127 days, respectively. 
If the 1E2259+586 event were due to a spin-up glitch and subsequent 
over-recovery, we place a 30 upper limit on the recovery decay time of 
3.9 days for a spin-up of size Av/v = 1 X 10° °. Even for an infinite- 
simally small spin-up glitch, the decay time was less than 6.6 days, far 
shorter than any previously observed magnetar recovery timescales. 

Following the detection of the anti-glitch, we looked for evidence of 
particle outflow, proposed as a possible mechanism for the apparent 
spin-down in SGR 1900+ 14 (ref. 20). We carried out radio imaging on 
21 August 2012 using the Expanded Very Large Array (New Mexico, 
USA) in the B-array configuration with a 240-min integration time. 
This yielded images with effective angular resolution of 1.2”. We per- 
formed standard flagging, calibration and imaging using the Common 
Astronomy Software Applications (CASA) package’'. No source was 
found at the position of 1E2259+586, and we place a 3a flux density 
limit of 7.2 Jy at 7 GHz for a point source. This is significantly lower 
than the previous upper limit of 50 Jy at 1.4 GHz (ref. 9). Ifa putative 
outflow were expanding at 0.7c (where c is the velocity of light in a 
vacuum), as was the case for a radio outflow from SGR 1806-20 (ref. 22) 
at the time of its outburst, we would expect a nebular radius of 4”. For 
this radius, we obtain a 3 flux density limit of 0.46 mJy. The limit is 
more stringent if the size is smaller, and reduces to 7.2 Jy if unresolved. 

In X-rays, we also detected no evidence for such outflow in a 10-ks 
Chandra High Resolution Camera (HRC)-I image taken on 21 August 
2012. Using simulations, we place an upper limit on X-ray flux from a 
putative outflow at 2% of the total 1-10-keV X-ray emission of the 
magnetar, for a 4” circular nebula with a Crab-like spectrum. 

There are two main possibilities for the origin of the anti-glitch: 
either an internal or external mechanism, as follows. An impulse-like 
angular-momentum transfer between regions of more slowly spinning 
superfluid and the crust could be the source of the anti-glitch”. A 
slower angular-momentum transfer to such a region or the decoupling 
of a significant amount of the moment of inertia of the star could 
account for the enhanced spin-down rate. The second event, either 
glitch or anti-glitch, can similarly be modelled by angular-momentum 
transfers from differentially rotating regions of the neutron star super- 
fluid. The radiatively quiet nature of the second event does not pose a 
problem for the internal model because many glitches are observed to 
be radiatively silent’. The behaviour indicated by an impulsive anti- 
glitch offers new evidence for possible significant internal structural 
changes and differential rotation in magnetars at glitch epochs. 

An external model such as an outflow along the open field lines of 
the magnetosphere*’**™, or a sudden twisting of the field lines**, could 
be the cause of the anomalous spin-down behaviour. However, in a 
wind model, the second timing event should also be accompanied by a 
radiative change, like the first one. If this behaviour was caused by 
twisting magnetic field lines, it should be followed by a gradual 
untwisting and a similar behaviour reflected in ¥ (ref. 26; see Sup- 
plementary Information). We suggest that this magnetar anti-glitch, 
X-ray outburst, and subsequent evolution indicate the need for a 
rethinking of glitch theory for all neutron stars'®"’. 
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Cloning of Dirac fermions in graphene superlattices 
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K. Geim!? 


Superlattices have attracted great interest because their use may 
make it possible to modify the spectra of two-dimensional electron 
systems and, ultimately, create materials with tailored electronic 
properties’ *. In previous studies (see, for example, refs 1-8), it 
proved difficult to realize superlattices with short periodicities 
and weak disorder, and most of their observed features could be 
explained in terms of cyclotron orbits commensurate with the 
superlattice’*. Evidence for the formation of superlattice mini- 
bands (forming a fractal spectrum known as Hofstadter’s butterfly’) 
has been limited to the observation of new low-field oscillations’ 
and an internal structure within Landau levels®*. Here we report 
transport properties of graphene placed on a boron nitride sub- 
strate and accurately aligned along its crystallographic directions. 
The substrate’s moiré potential'®”” acts as a superlattice and leads 
to profound changes in the graphene’s electronic spectrum. Second- 
generation Dirac points'*” appear as pronounced peaks in resisti- 
vity, accompanied by reversal of the Hall effect. The latter indicates 
that the effective sign of the charge carriers changes within graphene’s 
conduction and valence bands. Strong magnetic fields lead to Zak- 
type cloning” of the third generation of Dirac points, which are 
observed as numerous neutrality points in fields where a unit fraction 
of the flux quantum pierces the superlattice unit cell. Graphene 
superlattices such as this one provide a way of studying the rich 
physics expected in incommensurable quantum systems’ °?** and 
illustrate the possibility of controllably modifying the electronic spec- 
tra of two-dimensional atomic crystals by varying their crystal- 
lographic alignment within van der Waals heterostuctures”’. 

Since the first observation of Weiss oscillations!”, two-dimensional 
electronic systems subjected to a periodic potential have been studied in 
great detail>*. The advent of graphene rapidly sparked interest in gra- 
phene superlattices'**. The principal novelty of such superlattices is 
the Dirac-like spectrum and the fact that charge carriers are not buried 
deep under the surface, which allows a relatively strong superlattice 
potential on the nanometre scale. One promising method of making 
nanoscale graphene superlattices is the use of a potential induced by 
another crystal. For example, graphene placed on top of graphite or 
hexagonal boron nitride (hBN) exhibits a moiré pattern’?'*”*, and the 
graphene’s tunnelling density of states becomes strongly modified’””*, 
indicating the formation of superlattice minibands. This spectral recon- 
struction occurs near the edges of the superlattice Brillouin zone (SBZ) 
that are characterized’?” by wavevector G=4n/\/3D and energy 
Es = hv,G/2 (D is the superlattice period, vp is graphene’s Fermi velo- 
city and vi is Planck’s constant divided by 27). 

To observe moiré minibands in transport properties, graphene has 
to be doped so that the Fermi energy reaches the reconstructed part of 
the spectrum. This imposes severe constraints on the misalignment 
angle, 0, of the graphene relative to the hBN substrate. Indeed, D is 
determined from @ and the 1.8% difference between the two lattice 
constants’. In the case of perfect alignment (0 = 0), D has a maximum 
value of ~13 + 1 nm (ref. 12), which yields Es ~ 0.2 eV. This energy 


scale corresponds to a carrier density of n ~ 3 X 10'*cm 2, which is 
achievable by field-effect doping. However, misorientation by only 2° 
decreases D by a factor of two’’, and fourfold greater values of n are 
necessary for the Fermi energy to reach the edges of the first SBZ. In 
practice, studies of the superlattice spectrum in monolayer graphene 
require 0 = 1° (Methods). 

Here we study high-mobility encapsulated graphene devices that are 
similar to those reported previously” but which involve a new element: 
crystallographic alignment between the graphene and the hBN with a 
precision of ~1°. Figure 1 shows typical behaviour of longitudinal and 
Hall resistivities (p,,, and p,,, respectively) for our aligned devices. 
There is the standard peak in p,, at n = 0, graphene’s main neutrality 
point. In addition, two other peaks appear symmetrically, one on either 
side of the main neutrality point, at high doping, n = +n. At low 
temperatures (T), the secondary peak on the hole side is stronger than 
that at the main neutrality point, whereas that on the electron side is 
~10 times weaker. The reversal in sign of p,, (Fig. 1b) cannot be 
explained by additional scattering and proves that hole-like and elec- 
tron-like carriers appear in the conduction and, respectively, valence 
bands of graphene. We attribute the extra neutrality points to the 
superlattice potential induced by the hBN, which results in minibands 
featuring isolated secondary Dirac points (Fig. la, inset). This inter- 
pretation agrees with theory’*” and the tunnelling features reported in 
ref. 12, including the fact that those were stronger in the valence band. 

Near the main neutrality point, the aligned devices have transport 
characteristics typical for graphene on hBN’’*. The conductivity 
o(n) = 1/p,,, varies linearly with n and can therefore be described in 
terms of constant mobility, 4. For the reported devices, we find that 
= (20-80) X 10°cm? V~'s~! for |n| > 10''cm~*. Around the sec- 
ondary neutrality points, o depends linearly on n — ng. At the hole-side 
secondary neutrality point (hSNP), at low temperature ju is practically 
the same as at the main neutrality point, whereas near the electron- 
side secondary neutrality point we find even higher values, ps ~ (30- 
100) X 10°cm? V's” '. However, at the main and secondary neutrality 
points the T dependences of both 1 and the minimum conductivities are 
very different. This is discussed in Supplementary Information, section 
1, and here we note only that the observed functions o(T) do not support 
the idea of major energy gaps being induced by the superlattice at the 
cloned secondary Dirac points’? ** (Fig. 1a, inset). Furthermore, follow- 
ing the approach described in ref. 29, we analysed the thermal broad- 
ening of the peaks in p,. (Supplementary Information, section 2). The 
analysis proves that the spectrum at the secondary neutrality points is 
linear, that is, Dirac-like, in agreement with theory’*”. 

Figure 2 shows the evolution of p,..(m) with increasing perpendicular 
magnetic field, B. Near the main Dirac point, we observe the standard*” 
quantum Hall effect (QHE) for graphene, with plateaux in p,,,and zeros 
in p,,at filling factors v = n@o/B = £2, +6, +10, ... where do is the flux 
quantum. Fan diagrams around the secondary Dirac points are dif- 
ferent (Fig. 2). The resistance peak of the hSNP first broadens with 
increasing B and then splits into two maxima. The maxima correspond 
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Figure 1 | Transport properties of Dirac fermions in moiré superlattices. 
a, Longitudinal resistivity, p,,, as a function of n. Positive and negative values ofn 
correspond to electrons and holes, respectively. The hole-side neutrality point 
shows a strong T dependence (Supplementary Information, section 1). Inset, one 
of the possible scenarios for the reconstruction of graphene’s spectrum”. The 
band structure is plotted only for the first and second SBZ (shown in brown and 
green, respectively). Secondary Dirac cones appear in both conduction and 
valence bands at the edges of the SBZ, shown by the black hexagon. Where the 
cones merge, van Hove singularities appear in the density of states (for details, 
see ref. 22). b, The Hall resistivity, p,,, changes sign at high electron and hole 
doping, revealing well-isolated secondary Dirac points. The data are for device A, 
for which ns ~ 3.0 X 10’? cm’, yielding D ~ 12 nm. We fabricated 11 aligned 
devices, six of which had essentially the same behaviour as shown here. The only 
difference was in ns, which varied between 3.0 X 10!? and 3.8 X 10!2cm 
(D~ 11+ 1nm). One other device had ng ~ 7.1 X 10’? cm 2, which implies'* 
that 0 ~ 1.2° and required gate voltages close to dielectric breakdown. Inset, 
conductive atomic force microscope image of the moiré pattern for one of our 
devices. The centre-to-centre separation between the white spots is ~11 nm. We 
note that, in measurements of Pry both positive and negative values of B were 
always used to symmetrize the data and subtract a small contribution due to p,,.. 


to the superlattice filling factors vs = +2, where the carrier density is 
counted from the hSNP. In the middle of each maximum, there is a 
deep minimum (narrow white stripes in Fig. 2a). The minima in p,,. are 
accompanied by positive and negative extrema in pxy (Fig. 2c, d). This 
shows that electron-like cyclotron trajectories in graphene’s valence 
band persist when B is quantizing (that is, when it quantizes the spec- 
trum). With decreasing T, p,.. inside the narrow minima tends to zero 
and the corresponding extrema in p,, become increasingly more pro- 
nounced, which is behaviour characteristic of the development of 
Shubnikov-de Haas oscillations into QHE states (Fig. 2c, d). The T 
dependence yields a QHE gap of ~20 meV (Supplementary Fig. 6). 
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Figure 2 | Quantization in graphene superlattices. a, Longitudinal resistivity, 
Pxx(n, B), at 20 K. Grey scale: white, 0 kQ; black, 8.5 kQ. b, Magnified view near 
the electron-side secondary neutrality point. Grey scale: white, 0 kQ; black, 

1.1 kQ. Blue numbers denote v for the QHE states extending from the main 
Dirac point. The red arrows ina mark the superlattice quantum states that evolve 
along vs = +2 (the arrows are shifted so as not to obscure the white stripes). In 
b, the red arrows indicate vg for the electron-side neutrality point. It is difficult to 
associate the electron-side superlattice Landau levels decisively with any 
particular value of vs, although the strongest peak in p,, corresponds closely to 
Vs = +2.¢ d, Detailed behaviour near the hSNP in fields marked by the dashed 
purple and green lines in a. Data are for device A but the same quantization 
behaviour was found in all devices that exhibited the secondary neutrality points. 
An exception is the white stripes at vy = +2, which were often smeared by 
inhomogeneity such that only broader maxima in p,,. remained (similar to the 
curves at 50 K). However, the narrow extrema in Pry associated with the minima 
in Px (d), were always present. h, Planck’s constant; e, electron charge. 


Unlike cyclotron gaps, this one is practically independent of B, as is 
seen also from the fact that the white stripes in Fig. 2a do not widen. 
With increasing T, the QHE states at ve = +2 gradually disappear 
below 50 K but the maxima in p,,. persist up to 150 K. 

Another notable feature of the observed fan diagrams are the mul- 
tiple peaks in p,,.accompanied by zeros or deep minima in p,,,. This is 
seen most clearly for devices where doping sufficiently higher than ng 
can be achieved (Fig. 3). Furthermore, in all our devices near the hSNP, 
Pxy repeatedly changes its sign with increasing B, indicating recurrent 
appearance and disappearance of electron-like orbits within graphene’s 
valence band (Fig. 3b, e and Supplementary Figs 4 and 5). This means 
that, for a given n, the magnetic field alone can repeatedly generate new 
neutrality points. Such ‘third-generation’ neutrality points occur peri- 
odically as n and 1/B vary, and form distinct groups characterized by 
particular values of 1/B (Fig. 3 and Supplementary Information, sec- 
tion 3). Their periodicity in 1/B is accurately described by unit fractions, 
o/q, of the magnetic flux, ® = BS, per superlattice unit cell area, Sg, 
where q is integer. In the conduction band, the fan diagrams also exhibit 
Landau levels extending from the secondary Dirac point, and numerous 
third-generation neutrality points with the same 1/B periodicity are 
visible in Figs 2b and 3a, b. These features are weaker than those in 
the valence band. For example, the resistivity peak at the electron-side 
neutrality point is no longer there for B ~ 1 T (Fig. 2b) and we did not 
observe the secondary QHE in the conduction band. 

The observed superlattice behaviour suggests that complex spectral 
changes are induced by quantizing B. Theoretically, the problem is 
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Figure 3 | Zak-type cloning of third-generation Dirac points. 

a, Longitudinal conductivity, 0.1 = Pxx/ (Dex? + Puy )s as a function of n and B 
(device B). Grey scale: white, 0 kQ; black, 2.2 mS. The dashed lines indicate 
Bg = (1/q)o/S@ with q = 3-10, where S@ is determined from the measured 
ns ~ 3.8 X 10’ cm 7 as S@ = 4/ns, which corresponds to the complete filling 
of the first SBZ”. b, Hall resistivity, p,,, for the same device as a function of n 
and 1/B (the latter expressed as #/@ on the left axis). The dashed white lines 
show the periodicity of Zak oscillations. Colour scale: navy, —2 kQ; white, 0 kQ; 
wine, 2 kQ. In both plots, T = 2 K. c, Hofstadter-like butterfly for the graphene- 
on-hBN superlattice. The electronic states are calculated following the 
approach in ref. 24 and are shown by black dots. For simple fractions p/q, we 
plot energies of the states in red. The regions around ® = #op/q are empty 


somewhat similar to that originally discussed by Zak”* and Hofstadter? 
and later considered for two-dimensional electron systems’ in semi- 
conductor superlattices and for Dirac fermions in twisted bilayers. 
The most general, but not proven, prediction is that superlattice spectra 
should be ‘self-similar’; that is, they should consist of multiple clones of 
an original spectrum, which appear at values of B such that ® = $o(p/q), 
where both p and q are integer. Our case of graphene on hBN is analysed 
in Supplementary Information, and the main theoretical results are 
summarized in Fig. 3c, d. 

Figure 3c shows that the superlattice potential results in additional 
structure within each Landau level, which effectively broadens them 
with increasing B. The structured Landau levels extending from the main 
and secondary Dirac points strongly mix at high doping, |E| = Es = 
hv,(nns)?. The resulting pattern is different from that in semiconductor 
superlattices with a parabolic spectrum and weak modulation*”*. In the 
latter case, the fractal structure within each Landau level can be described 
by the original Hofstadter butterfly’, which appears periodically as a 
function of ¢/®. In our case, in which there is a Dirac-like spectrum 
and strong modulation, the fractal pattern depends on the Landau level 
index, N, and B (Supplementary Information, section 5). 

The calculated spectrum allows us to understand many features 
observed experimentally. Indeed, Fig. 3c shows a self-similarity such 
that magnetic states tend to entwine at © = dop/q, forming the fractal 
structure of the pattern. The strongest entwining occurs for unit fractions 
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because the corresponding supercells are too large for us to do the necessary 
calculations****. The blue curves show several low Landau levels for small B, 
which were calculated analytically for main and secondary Dirac fermions with 
parameters of the zero-B spectrum. The green dots indicate the position of the 
Fermi level for n = —ng (Supplementary Information). d, Section of ¢ with 
superimposed Landau levels calculated as functions of 6B (magenta). e, Hall 
resistivity, p,,(n, 1/B), for device C (ns ~ 3.6 X 10'*cm”) measured up to 

29 T. The left and right axes are as in b. The same oscillatory behaviour is found 
for all our devices and is seen most clearly near the hSNP, where new neutrality 
points appear periodically for ¢o/® = q, as shown by the arrows. Colour scale: 
navy, —3 kQ; white, 0 kQ; wine, 3 kQ. T= 20K. Full plots of Pry(Ms B) for 
devices B and C are provided in Supplementary Information, section 3. 


(thatis, p = 1), and this results in an overall periodicity in 1/B with a period 
of Sg/o, in agreement with the experiment (Fig. 3a, b). The periodicity 
can be traced to the fact that for ® = #o/q the system can be considered a 
new superlattice that has a unit cell q times larger than the original and that 
is placed in zero effective magnetic field”. An example of the resulting 
magneto-electronic (Zak) bands” is given in Supplementary Fig. 8. 

We find that in our case Zak bands feature slightly gapped Dirac 
spectra. This finding is illustrated in Fig. 3d, which, as an example, 
magnifies a part of Fig. 3c near the hSNP and ® = ¢/2. Using the Zak 
spectrum calculated for ® = ¢/2, we can obtain its Landau quantiza- 
tion in small reduced fields, 5B = B — Bz, where Bz = $9/2S@ is the 
zero effective field acting on the cloned Dirac spectrum. The shape 
of the resulting Landau levels is given by + \/N|8B|+ A’, where 4 is 
the gap in the local Dirac-like spectrum (Fig. 3d, magenta curves). 
Outside the empty horizontal region inaccessible in our numerical 
calculations (Supplementary Information, section5), the Landau 
quantization of the Zak spectrum in 6B yields practically the same 
electronic states as shown by the calculated points (Fig. 3d, black dots). 
Similar local Dirac spectra are found in other parts of the moiré 
butterfly for all B,=o/qS@, in agreement with the numerous 
third-generation neutrality points seen in the experiment. Using the 
calculated spectra, we have also determined the occupancy of Zak 
minibands for the case of four holes per moiré supercell (that is, at 
the hSNP) and found that for ® = #/q the Fermi energy lies inside the 
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corresponding Zak minibands, whereas for ® = ¢o/(q + 1/2) it lies 
inside gaps. This explains the experimentally observed oscillations in 
Oxx(B). 

Another experimental feature revealing Zak minibands and the 
hierarchy of superlattice gaps are the prominent QHE gaps at 
Vg = +2 near the hSNP (Fig. 2). For small B, they can be considered 
a result of Landau quantization for secondary Dirac fermions, and 
their zero Landau levels become separate from the rest of the spectrum 
(Fig. 3c). For higher B, the resulting gaps saturate, being limited in size 
by the presence of van Hove singularities at the edges of the SBZ 
(Fig. la). As Zak minibands become increasingly more pronounced 
the secondary Landau levels intertwine with main Landau levels and, 
at high doping, become indistinguishable from them (Fig. 3c). 
Therefore, the complex pattern of Landau levels in Fig. 3 at high 
doping can no longer be interpreted in terms of Landau quantization 
of either main or secondary Dirac fermions. The pattern becomes a 
‘Hofstadter-Landau’ butterfly, specific to our strong-modulation 
regime and the linear spectrum. The largest fractal gaps near the 
hole-side Dirac point in Fig. 3c are in agreement with the vs = +2 
QHE states observed experimentally, which have activation energies 
almost independent of B. This behaviour is different from the case of 
weak modulation in semiconductor superlattices®*, where Landau 
levels become structured but do not intertwine. However, increasing 
B such that © > go can drive graphene superlattices into the regime of 
weak modulation (Supplementary Fig. 9). This regime is outside the 
range of B values available in our experiment. In addition to the large 
fractal gaps, our experimental data also reveal reproducible small-scale 
structure that cannot be traced back to either main or secondary neut- 
rality points (see, for example, Fig. 3a, b near the hSNP at high B and 
Supplementary Fig. 5a). We attribute these fine features to further 
fractalization of the superlattice spectrum such that isolated Landau 
levels for the third-generation Dirac clones start being resolved 
(Fig. 3d). This is similar to the intra-Landau-level features reported 
in semiconductor superlattices’*® and warrants further investigation. 

Graphene superlattices can be reliably fabricated for various types of 
transport measurement. This opens new lines of enquiry; in particular, 
the fractal quantization leads to such rich behaviour that its full 
understanding will require much further work, both theoretical and 
experimental. The demonstrated possibility of creating gaps at specif- 
ically chosen energies by controllably rotating graphene or other two- 
dimensional crystals within van der Waals heterostructures” can be 
used to design novel electronic and optoelectronic devices. 


METHODS SUMMARY 


Our devices were multiterminal Hall bars fabricated following the procedure 
described in ref. 27. In brief, monolayer graphene was deposited on top of a rela- 
tively thick (>30 nm) hBN crystal”* and then covered with another hBN crystal. 
The encapsulation protects the graphene from the environment and allows high 1 
values, little residual doping (<10"' cm”) and little charge inhomogeneity”. The 
interfaces between the graphene and the hBN were atomically clean over the entire 
active device area”’. The whole stack was assembled on top of an oxidized Si wafer 
that served as a back gate. To align the crystal lattices, we used an optical microscope 
to choose straight edges of graphene and hBN crystallites, which indicate the 
principal crystallographic directions (see, for example, figure 2 of ref. 30). During 
the assembly, the graphene was rotated relatively to the bottom hBN crystal to make 
their edges parallel. We estimate our alignment precision to be ~1°. The top hBN 
crystal was then rotated by ~15° with respect to the aligned edges, which ensured 
no spectral reconstruction at Es; < 1 eV due to the second graphene-hBN interface. 

In practice, gate dielectric breakdown for oxidized silicon wafers occurs in 
fields less than 0.4Vnm_!, and this limits achievable values of 1 to less than 
7X10 cm ? (Es <0.35 eV). Accordingly, the observation of secondary Dirac 
points requires alignment with 0 = 1° (ref. 12). For random deposition of graphene 
on hBN, the probability of finding transport devices with superlattice features can 
be estimated to be only a few per cent, even if high gate voltages are used, which is 
rare given the desire to avoid accidental breakdown. Previously, we investigated 
more than 25 graphene-on-hBN devices” and none of them exhibited any sign of 
superlattice effects. This shows that careful alignment is essential for the obser- 
vation of secondary Dirac spectra in transport measurements. 
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Hofstadter’s butterfly and the fractal quantum Hall 
effect in moire superlattices 


C. R. Dean!, L. Wang’, P. Maher’, C. Forsythe®, F. Ghahari®, Y. Gao”, J. Katoch’, M. Ishigami*, P. Moon®, M. Koshino®, 


T. Taniguchi®, K. Watanabe®, K. L. Shepard’, J. Hone? & P. Kim? 


Electrons moving through a spatially periodic lattice potential 
develop a quantized energy spectrum consisting of discrete Bloch 
bands. In two dimensions, electrons moving through a magnetic 
field also develop a quantized energy spectrum, consisting of highly 
degenerate Landau energy levels. When subject to both a magnetic 
field and a periodic electrostatic potential, two-dimensional systems 
of electrons exhibit a self-similar recursive energy spectrum’. Known 
as Hofstadter’s butterfly, this complex spectrum results from an 
interplay between the characteristic lengths associated with the two 
quantizing fields’, and is one of the first quantum fractals discov- 
ered in physics. In the decades since its prediction, experimental 
attempts to study this effect have been limited by difficulties in recon- 
ciling the two length scales. Typical atomic lattices (with periodicities 
of less than one nanometre) require unfeasibly large magnetic fields 
to reach the commensurability condition, and in artificially engi- 
neered structures (with periodicities greater than about 100 nano- 
metres) the corresponding fields are too small to overcome disorder 
completely'’”. Here we demonstrate that moiré superlattices arising 
in bilayer graphene coupled to hexagonal boron nitride provide a 
periodic modulation with ideal length scales of the order of ten nano- 
metres, enabling unprecedented experimental access to the fractal 
spectrum. We confirm that quantum Hall features associated with 
the fractal gaps are described by two integer topological quantum 
numbers, and report evidence of their recursive structure. Observa- 
tion of a Hofstadter spectrum in bilayer graphene means that it is 
possible to investigate emergent behaviour within a fractal energy 
landscape in a system with tunable internal degrees of freedom. 
The total number of electron states per area of a completely filled 
Bloch band is m9 = 1/A, where A is the area of the unit cell of the periodic 
potential. In a magnetic field, B, the number of states per area of each 
filled Landau level is given by B/o, where #9 = h/e is the magnetic flux 
quantum (h, Planck’s constant; e, magnitude of the electron charge). 
The quantum description of electrons subjected simultaneously to both 
a periodic electric field and a magnetic field can be simply parameter- 
ized by the dimensionless ratio #/#, where ¢ = BA is the magnetic flux 
per unit cell. The general solution, however, exhibits a rich complexity 
due to the incommensurate periodicities of the Bloch and Landau 
states'*, For commensurate fields, corresponding to rational values of 
$/#o = p/q, where p and q are co-prime integers, the single-particle 
Bloch band splits into q subbands’ (beginning with the Landau level 
description, it can be shown that, equivalently, the energy diagram is 
parameterized by ¢,/ = q/p such that at these same rational values 
each Landau level splits into p subbands’). This results in a quasi- 
continuous distribution of incommensurate quantum states with a 
self-similar recursive structure, yielding a butterfly-like fractal energy 
diagram called the Hofstadter buttefly' (Supplementary Information). 
Important insight into this system came from consideration of the 
density of charge carriers, n, required to fill each fractal subband?. 


Replotting the Hofstadter energy spectrum as integrated density versus 
field shows that all spectral gaps are constrained to linear trajectories in 
the density—field diagram (Wannier diagram). This can be described 
by a simple Diophantine relation 


(n/n) =t($/$o) +s (1) 


where n/n, and $/¢, are the normalized carrier density and magnetic flux, 
respectively, and s and f are both integer valued. Here n/n, represents the 
Bloch band filling fraction, which is distinct from the usual Landau level 
filling fraction, v = ndo/B (the two are related by the normalized flux, that 
is, n/No = vol). The physical significance of the quantum numbers s 
and t became fully apparent with the discovery of the integer quantum 
Hall effect’? (QHE), after which it was shown that the Hall conductivity 
associated with each minigap in the fractal spectrum is quantized accord- 
ing to the relation Oxy = te*/h (refs 3, 4). The second quantum number, s, 
physically corresponds to the Bloch band filling index in the fractal 
spectrum®. This formalism suggests several unique and unambiguous 
experimental signatures associated with the Hofstadter energy spectrum 
that are distinct from the conventional QHE. First, the Hall conductance 
can vary non-monotonically and can even fluctuate in sign. Second, the 
Hall conductance plateaux, together with vanishing longitudinal resist- 
ance, can appear at non-integer Landau level filling fractions. Third, the 
Hall conductance plateaux remain quantized in integral multiples of e”/h. 
However, the quantization integer is not directly associated with the usual 
Landau level filling fraction. Instead, quantization is equal to the slope of 
the gap trajectory in the n/n.-¢/d, Wannier diagram, in accordance with 
the Diophantine equation (equation (1)). 

Minigaps within the fractal energy spectrum become significant only 
once the magnetic length (/g = \///eB; h, Planck’s constant divided by 
2m), which characterizes the cyclotron motion, is of the same order as 
the wavelength of the periodic potential, which characterizes the Bloch 
waves. For usual crystal lattices, where the interatomic spacing is a few 
angstroms, the necessary magnetic field is unfeasibly large, in excess of 
10,000 T. The main experimental effort therefore has been to litho- 
graphically define artificial superlattices'’'” with unit-cell dimensions 
of order tens of nanometres, so that the critical magnetic field remains 
small enough to be achievable in the lab yet still large enough that 
the QHE is fully resolved without being smeared out by disorder. 
Fabricating the optimally sized periodic lattice while maintaining 
coherent registry over the full device and without introducing substan- 
tial disorder has proven to be a formidable technical challenge. Pat- 
terned GaAs/AlGaAs heterostructures with ~100-nm-period gates 
provided the first experimental support for the predictions of a 
Hofstadter spectrum'*’*. However, limited ability to tune the carrier 
density or reach the fully developed QHE regime in these samples has 
made it difficult to map out the complete spectrum. Although similar 
concepts have been pursued in non-solid-state model systems*°”’, the 
rich physics of the Hofstadter spectrum remains largely unexplored. 
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Heterostructures consisting of atomically thin materials in a multi- 
layer stack provide a new means of realizing a two-dimensional system 
with a laterally modulated periodic structure. In particular, coupling 
between graphene and hexagonal boron nitride (hBN), whose crystal 
lattices are isomorphic, results in a periodic moiré pattern. The moiré 
wavelength is directly related to the angular rotation between the two 
lattices’”4, and can be tuned through the desired length scales without 
the need for lithographic techniques*”. Moreover, hBN provides an 
ideal substrate for achieving high-mobility graphene devices, which is 
crucial for high-resolution quantum Hall measurements”, and field- 
effect gating in graphene allows the Fermi energy to be continuously 
varied through the entire moiré Bloch band. 

In this study, we used Bernal-stacked bilayer graphene (BLG) Hall 
bars fabricated on hBN substrates (Fig. 1a, b) using mechanical exfo- 
liation followed by co-lamination (Methods Summary). Figure 1b 
shows a non-contact atomic force microscopy (AFM) image acquired 
from an example device. In the magnified region, a triangular moiré 
pattern is visible with wavelength 15.5 + 0.9 nm. This is comparable to 
the maximal moiré wavelength of ~14nm expected for graphene on 
hBN*”™4, suggesting that in this device the BLG lattice is oriented 
relative to the underlying hBN lattice with near-zero angle mismatch. 

Figure 1c shows transport data measured from the same device. In 
addition to the usual resistance peak at the charge neutrality point (CNP), 
occurring at gate voltage V,~ 2, two additional satellite resistance 
peaks appear, symmetrically located at V,.~ +30 V relative to the 
CNP. These satellite features are consistent with a depression in the 
density of states at the superlattice Brillouin zone band edge, analogous 
to previous spectroscopic measurements of single-layer graphene 
coupled to a moiré superlattice*’’. Assuming non-overlapping bands, 
[Vea] gives an estimate of the moiré wavelength of ~14.6nm 
(Supplementary Information), in good agreement with the AFM mea- 
surements. The nature of these satellite peaks can be further probed in the 
semiclassical, low-B transport regime. In Fig. 1d, longitudinal resistance, 
Ry» and transverse Hall resistance, R,,, are plotted versus gate voltage at 


Graphene 
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Figure 1 | Moiré superlattice. a, Sketch of graphene on hBN showing the 
emergence of a moiré pattern. The moiré wavelength varies with the mismatch 
angle, 0. b, Left: an AFM image of a multiterminal Hall bar. Right: a high- 
resolution image of a magnified region. The moiré pattern is evident as a 
triangular lattice (upper inset shows a further magnified region). A fast Fourier 
transform of the scan area (lower inset) confirms a triangular lattice symmetry 
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B=1T. Near the central CNP, the Hall resistance changes sign as the 
Fermi energy passes from the electron to the hole band. The same trend 
also appears near Va, consistent with the Fermi energy passing through 
a second band edge. This provides further confirmation that the moiré 
pattern, acting as a periodic potential superlattice, gives rise to a mini- 
Brillouin zone band’. We observed the satellite peak to be more 
developed in the hole branch than in the electron branch in all samples, 
in agreement with previous experimental and theoretical studies of hBN- 
supported monolayer graphene*”””*. The satellite peaks vanish at tem- 
peratures above 100K (Fig. Ic, inset), indicating that the coupling 
between the BLG and hBN atomic lattices is of order ~10 meV. 
Perfect crystallographic alignment between graphene and hBN is 
expected to open a ~50-meV bandgap”’”®, leading to a low-temperature 
divergence in the resistance at the CNP. The weak temperature depend- 
ence observed in our device suggests the absence of a gap, possibly owing 
to the lattice mismatch between the BLG and hBN. 

In the remainder of this Letter, we focus on magnetotransport mea- 
sured at high field. Figure 2a shows the evolution of R,. and R,, for 
magnetic fields up to 31 T. In the left panel (a Landau fan diagram), R,. 
is plotted against the experimentally tunable gate voltage and magnetic 
field. In the right panel, the magnitude of the corresponding R,, is 
plotted against the dimensionless parameters appearing in the 
Diophantine equation, n/n, and $/,. This Wannier diagram is simply 
the Landau fan diagram with both axes relabelled by dimensionless 
units defined by normalizing to the moiré unit-cell area. 

Ina conventional quantum Hall system, the Landau fan diagram exhi- 
bits straight lines, tracking minima in R,,. and plateaux in R,,. Plotted 
against n/n, and #/d,, the slope of each line is precisely the Landau level 
filling fraction, v, and all lines converge to the origin. White lines in Fig. 2a 
identify QHE states matching this description, tracking Landau level 
filling fractions v = 4, 8 and 12. This is consistent with the usual QHE 
hierarchy associated with a conventional degenerate BLG spectrum. 

At large magnetic fields, several additional QHE states, exhibiting 
minima in R,.,, together with plateaux in R,,, develop outside the usual 


V, (V) 


with periodicity 15.5 + 0.9 nm. c, Measured resistance versus gate voltage at 
zero magnetic field. Inset: the corresponding conductivity versus temperature, 
indicating that the satellite features disappear at temperatures greater than 
~100K. d, Longitudinal resistance (left axis) and Hall resistance (right axis) 
versus gate voltage at B= 1 T. The Hall resistance changes sign and passes 
through zero at the same gate voltage as the satellite peaks. 
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Figure 2 | Emergence of anomalous quantum Hall states. a, Landau fan 
diagram showing longitudinal resistance, R,.. (left), and Hall resistance, Ryy 
(right). R, is plotted versus magnetic field on the vertical axis and versus gate 
bias on the horizontal axis. In the diagram showing R,,, the axes are scaled by 
the size of the moiré unit cell to give ¢/d, on the vertical axis and n/n, on the 
horizontal axis. QHE states corresponding to the conventional BLG spectrum 
are indicated by white lines. Solid yellow and red lines track the QHE outside 
this conventional spectrum, with dashed lines indicating the projected n/n, 
intercept. The slope of each line is shown on the top axis. b, Longitudinal and 


BLG sequence and also follow straight lines in the Landau fan diagram, 
but converge to non-zero values of n/n. Yellow and red lines in Fig. 2a 
trace examples of these anomalous QHE states appearing within the 
lowest Landau level. Unlike the conventional QHE states, each of the 
anomalous QHE states is characterized by both an integer-valued 
intercept, s (yellow and red lines converge to n/n, = 1 and 2, respec- 
tively) and an integer-valued slope, t (labelled along the top axis in the 
figure). In Fig. 2b, longitudinal and Hall conductivities measured at 
constant magnetic field (corresponding to horizontal line cuts through 
the fan diagram in Fig. 2a) are plotted against Landau level filling 
fraction, v. At large magnetic fields, the anomalous QHE states are 
remarkably well developed, exhibiting wide plateaux in o,,, concom- 
itant with o,, = 0. Moreover, these states appear in general at non- 
integer filling fractions. Comparison between Fig. 2a and Fig. 2b further 
reveals that Hall conductivity plateaux are quantized in integer multi- 
ples of e”/h, where the quantization integer t equals the slope in the 
Wannier diagram. Similar internal structure is observed within higher- 
order Landau levels (Fig. 3 and Supplementary Information). The ano- 
malous QHE states observed here are consistent with fully developed 
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transverse Hall conductivities corresponding to line cuts at constant magnetic 
field (constant #/#,) from the Landau fan diagram in a. At B= 7 T, the QHE 
ladder is consistent with previous reports for bilayer graphene. At B = 18 and 
26 T, additional QHE states emerge, showing Hall conductivity plateaux 
quantized in integer multiples of e”/h, but appearing at non-integer Landau 
level filling fractions. Yellow and red bars indicate correspondence to the 
similarly coloured anomalous features marked by solid lines in a. Blue bars 
indicate the conventional QHE features. Numbers label the quantization 
integer for each plateau. 


spectral gaps resulting from a Hofstadter-type energy spectrum. 
Moreover, our ability to map fully the density-field space provides a 
remarkable confirmation of the Diophantine equation: we observe 
direct evidence that QHE features associated with the Hofstadter spec- 
tral gaps are characterized by the two quantum numbers, s and f, cor- 
responding to the n/n, intercept and the slope, respectively, in the 
Wannier diagram. 

Figure 3 shows similar data to Fig. 2, but measured from a separate 
device in which the moiré wavelength is only 11.6nm. Again, QHE 
states appear outside the conventional Bernal-stacked BLG sequence 
and follow straight lines whose origin and slope are both integer 
valued, with the slope exactly matching the Hall quantization, in pre- 
cise agreement with the Diophantine equation. Similar to the previous 
device, the v= 0 insulating state undergoes a drastic change near 
$/¢. = 1/2, when anomalous QHE states associated with the fractal 
gaps begin to develop fully. 

In Fig. 3b, the lower panel shows o,,, measured at B = 25, corres- 
ponding to a horizontal line cut through Fig. 3a (dashed white line), for 
a variety of sample temperatures. The magnitudes of the fractal gaps 
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Figure 3 | Fractal gaps. a, Landau fan diagrams similar to those in Fig. 2 but 
measured from a separate device. Here the zero-field satellite peak position 
indicates a moiré period of 11.6 nm, indicating that the superlattice unit cell was 
approximately 1.5 times smaller in this device than in the one used in Fig. 2. 
Significantly more structure is observed here than in Fig. 2. b, Bottom: the 
evolution of o,. with temperature varying between 2 and 20K, acquired at 
constant B = 25 T, which corresponds to the line cut shown in a. Top: the 
corresponding 6,, at T = 2 K. The bracketed numbers label the (s, f) values of 
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the corresponding fractal gaps according to the Diophantine equation. 

c, Bubble plot of energy gaps determined from the temperature dependence 
calculated at two magnetic fields (B = 25 and 28.5 T). The gaps are plotted as 
circles with radius scaled relative to the largest gap value measured. Dashed 
lines trace select fractal gap positions allowed by the Diophantine equation. 
Solid lines trace regions where the corresponding fractal gaps appear as minima 
in ¢,, together with quantized plateaux in o,,. The colours indicate gaps with 
the same quantum number s. 
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were estimated from the temperature dependence of the o,,. minima in 
the thermally activated regime (Supplementary Information), at two 
separate magnetic field values, 25 and 28.5 T. Figure 3c summarizes 
our findings. Each fractal gap is marked by a circle centred at the 
corresponding (n/n,, ¢/p,) coordinates, and with radius scaled relative 
to the largest gap value (Supplementary Information). As the magnetic 
field increases, the spectral energies develop in a complicated way: 
some gaps grow with field (for example those with (s, t) = (1, 1) and 
(2, —2)), whereas others shrink (for example (1, —4)). For a fixed 
magnetic field, it seems generally true that for constant s values, fractal 
gap states exhibit increasing gap size as t increases. For example, at 
B=25T, 4a,—4) ~ 48 K whereas 4(1,—3) ~ 30 K. This contradicts the 
prediction that fractal gaps corresponding to lower quantum numbers 
have larger gap values'. We note that such a trend was subsequently 
found to be specific to square lattice symmetries**. Furthermore, a 
non-trivial case also arises when two fractal gap states overlap’, 
such as occurs between the (1, —2) and (2, —3) states in our data as 
’/.— 1. Further theoretical analysis specific to moiré-patterned BLG 
is necessary to understand fully the trends highlighted here. 

Figure 4a shows a normalized Landau fan diagram of R,, values 
corresponding to the R,, data in Fig. 3a. Dashed horizontal lines in the 
figure label special values of the normalized magnetic flux, $/o = 1\/m, 
where m is integer valued. Referred to as the ‘pure cases”, these lines of 
high symmetry provide the framework for the recursive structure of 
the butterfly spectrum, marking the boundaries of the repeating sub- 
cells that appear within the main cell°. In Fig. 4a, at the pure cases in the 
fan diagram, R,., seems to tend to zero and change sign. This is also 
seen from the single line trace in Fig. 4b. In the quantum Hall regime, 
the longitudinal conductivity has a local peak as the magnetic field 
passes through the pure cases, with the corresponding Hall conductivity 
exhibiting a sharp transition. For large magnetic fields, both of these 


LETTER 


features span the full Landau level along lines of constant $/q,, as seen 
in Figs 3a and 4a, respectively. Near the field corresponding to $/¢, = 
1/2, labelled in Fig. 4c as B,/2, plateaux appear in R,, together with 
minima in R,,, resembling a mini QHE series centred on B,/. If we 
redefine the local effective magnetic field as B’ = B — B,,/», then, accord- 
ing to the usual QHE formalism, we expect to find that v’ = (1/B’)n'h/e, 
where v’ is an effective filling fraction given by the Hall quantization, B’ 
is the value of the effective magnetic field at the R,,. minima and n’ isan 
effective carrier density. The inset in Fig. 4c shows a plot of v’ versus 
1/B’ and the data indeed follows a linear trend. In spite of the large 
magnetic field (B,/2 ~ 17.3 T), this indicates that locally the electrons 
behave as if the magnetic field is reduced to zero. We regard this as 
compelling evidence of the long-predicted recursive nature of the 
Hofstadter spectrum, where repeated mini fan diagrams emerge within 
the main one. We note that the linear trend shown inset in Fig. 4c does 
not pass through the origin, but is vertically offset by 4.1 + 0.1. The 
origin of this offset is unclear but may be related to disorder effects 
because in this regime the spectrum is not fully gapped’. 

Finally, we note that precise modelling of the fractal spectrum (Sup- 
plementary Information) requires a quantitative understanding of the 
atomic-scale couplings between BLG and hBN and between the BLG 
layers, which are not well known. Additionally the odd-integer quantum 
numbers observed in our experiment, which may not be accounted for in 
a purely single-particle picture, invite further investigation of correlated 
electron behaviour within the Hofstadter spectrum. 

In conclusion, we report experimental confirmation that the general- 
ized behaviour of electrons subjected simultaneously to both a magnetic 
field and a spatially varying periodic electrostatic field is described by a 
Diophantine equation involving two topological quantum numbers. 
The ability to tune the competing length scales governing these 
Landau-Bloch-Dirac states in graphene coupled to a substrate-induced 
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Figure 4 | Recursive structure. a, R,,, Wannier diagram for the device used in 
Fig. 3. White solid lines label ¢/¢, values corresponding to the pure cases, 
$/, = 1/m. Inset: data replotted against ¢,/¢, illustrating that the main 
experimental features exhibit a 1/B periodicity. b, Longitudinal conductivity, 
Oxx (left axis), and Hall conductivity, o,,, (right axis), versus magnetic field, 
measured at a constant gate voltage (V, = —39 V, corresponding to the white 
dashed line in a). Blue, green and purple bands mark the boundaries of the 
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conventional N = 2, 3 and 4 Landau levels, respectively. Dashed lines mark the 
values of B corresponding to the pure fractions labelled in a. The values of $/#, 
are shown on the top axis. c, Details of both o,. and ¢,, in the vicinity of 

$/b 5 = 1/2 (b). Plateaux in Oxy concomitant with minima in ¢, resemble a mini 
QHE trace centred on $/#, = 1/2, consistent with the prediction of recursion in 
the butterfly spectrum’. Inset: the positions of the mini QHE plateaux plotted 
versus effective magnetic field follow a linear trend (see text). 
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moiré superlattice makes it possible to study experimentally the com- 
plexity of the Hofstadter energy spectrum and, in particular, to invest- 
igate the role of electron interactions within it. 


METHODS SUMMARY 


Using a co-lamination mechanical transfer technique similar to that described 
previously*>”*, graphene-hBN stacks were fabricated on doped Si substrates with a 
~300-nm oxide layer. Each stack was then etched into a Hall bar with evaporated 
Cr/Pd/Au leads using standard electron-beam lithography processes. The bottom 
boron nitride layer was chosen to be between 10 and 20 nm thick. More than 20 
devices were made in this way, with six devices showing similar behaviour to that 
reported here. We focus only on two high-quality devices in the text, listing other 
examples in Supplementary Information. The device shown in Fig. 1b represents a 
typical Hall bar geometry with a channel width of 1 um and spacing between 
voltage probes varying from 1 to ~4 jm. Identical transport features were mea- 
sured across all voltage probes, indicating that the moiré pattern exhibits good 
uniformity over the entire length of the device. Mobility in our devices ranged 
from approximately 10,000 to 100,000 cm?V~!s~!, as estimated by fitting a 
Boltzmann model to the linear response. Four-terminal transport measurements 
were performed using a lock-in amplifier at 17 Hz with a 10-100-nA source 
current, using the doped silicon substrate to gate the channel. Samples were mea- 
sured in a 31-T resistive magnet and a “He cryostat (sample in vapour). 
Longitudinal and Hall conductivities were calculated from the measured resistances 
according to xx = Py jf (P + R,) and oxy =Ry ( pat R,) , respectively. 

AEM images of the device were acquired at room temperature, using an 
Omicron low-temperature AFM system. Imaging was performed using a bias 
voltage of Vpias = 0.2 V and frequency shift of 6f= 20 Hz. Images were filtered 
to remove noise. 
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Persistent export of ~’Pa from the deep central Arctic 
Ocean over the past 35,000 years 


Sharon S. Hoffmann’, Jerry F. McManus', William B. Curry”? & L. Susan Brown-Leger* 


The Arctic Ocean has an important role in Earth’s climate, both 
through surface processes’ such as sea-ice formation and transport, 
and through the production and export of waters at depth that 
contribute to the global thermohaline circulation”*. Deciphering 
the deep Arctic Ocean’s palaeo-oceanographic history is a crucial 
part of understanding its role in climatic change. Here we show 
that sedimentary ratios of the radionuclides thorium-230 (7?°Th) 
and protactinium-231 (7*'Pa), which are produced in sea water 
and removed by particle scavenging on timescales of decades to 
centuries, respectively*, record consistent evidence for the export 
of 7*'Pa from the deep Arctic and may indicate continuous deep- 
water exchange between the Arctic and Atlantic oceans throughout 
the past 35,000 years. Seven well-dated box-core records provide a 
comprehensive overview of ~*'Pa and 7*°Th burial in Arctic sediments 
during glacial, deglacial and interglacial conditions. Sedimentary 
?31pa/?>°Th ratios decrease nearly linearly with increasing water depth 
above the core sites, indicating efficient particle scavenging in the 
upper water column and greater influence of removal by lateral trans- 
port at depth. Although the measured **°Th burial is in balance with 
its production in Arctic sea water, integrated depth profiles for all time 
intervals reveal a deficit in **'Pa burial that can be balanced only by 
lateral export in the water column. Because no enhanced sink for 7*'Pa 
has yet been found in the Arctic, our records suggest that deep-water 
exchange through the Fram strait may export 7*Pa. Such export may 
have continued for the past 35,000 years, suggesting a century-scale 
replacement time for deep waters in the Arctic Ocean since the most 
recent glaciation and a persistent contribution of Arctic waters to the 
global ocean circulation. 

Waters in the Arctic are freshened by precipitation and run-off, 
leading to a net shallow freshwater transport into the Nordic seas’. 
Sea-ice production and melting freshen surface waters through salt 
rejection, and in some marginal settings also produce cold, saline 
waters that descend into the Arctic basins**. These dense waters travel 
in topographically steered deep currents and eventually transit the 
Fram strait, the single deep Arctic passage, into the Nordic seas, where 
they contribute to the production of North Atlantic Deep Water, a 
major component of the global thermohaline circulation’*. Beneath 
the cold, fresh surface, warm, salty, Atlantic-sourced waters fill the 
Arctic depths above 1 km; these waters may have deepened to domi- 
nate Arctic intermediate depths during past colder periods as a result 
of changes in stratification®.Such past changes in water masses may 
imply changing rates of ventilation and circulation. 

Although variations in ratios of **'Pa and *°°Th in sediment can 
provide information about past rates of oceanographic processes 
including deep circulation®*, these tracers have seldom been applied 
in Arctic sediments. Measurements of **'Pa and *°’Th in the modern- 
day Arctic water column’ suggest variable particle scavenging rates 
of **'Pa and **’Th in different parts of the basin, including relatively 
high scavenging rates within the Eurasian basin and in parts of the 
Canadian basin. The burial rate of **°Th in central Arctic sediments 
roughly balances the water column production rate, supporting the use 


of *?'Pa,,/°°Th,, (where the subscripts refer to ‘excess’ unsupported 
isotope activity age-corrected for radioactive decay since deposition; 
see Supplementary Information) as a tracer of past oceanographic 
processes in the Arctic'*. Measured 31D a.g/7>°Thy« ratios in core-top 
sediments within all four Arctic sub-basins and along the Alaska- 
Chukchi margin" fall below the production ratio of 0.093, suggesting 
a net export of ?31Da. Two records of 7*'Pa,, from the Arctic provide 
the first down-core evidence'™”” for past 7*'Pa burial, although neither 
is sufficiently resolved to give detailed information on patterns of *?'Pa 
transport and deposition. 

Seven box cores (Fig. 1 and Supplementary Table 1) with well- 
defined radiocarbon chronologies’*”’ (Supplementary Table 2) from 
the 1994 Arctic Ocean Section cruise provide a cross-Arctic transect at 
depths within intermediate and deep waters (depths of 1-3.5km), 
which we use to examine regional and depth-related trends in **'Pa 
deposition and export. Deep-water residence times in these basins can 
vary from ~150 to 400 yr (refs 20, 21), similar to or longer than the 
residence times of dissolved **'Pa in these deep waters before scav- 
enging by particles'?”. 


Figure 1 | Locations of seven box cores in a transect across the Arctic Ocean. 
PL-94-AR box cores 08, 16, 17 and 20, Makarov basin, 1-3.1-km water depth. 
PL-94-AR box core 26, Lomonosov ridge, 1 km; box core 28, Amundsen basin, 
2 km ; box core 32, Nansen basin, 3.5 km. Arctic bathymetric features: CB, 
Canada basin; AMR, Alpha—Mendeleev ridge; MB, Makarov basin; LR, 
Lomonosov ridge; AB, Amundsen basin; GR, Gakkel ridge; NB, Nansen basin; 
FS, Fram strait. For core coordinates and water depths, see Supplementary 
Table 1. Orange arrows show patterns of intermediate and deep circulation’. 
The base map used is the International Bathymetric Chart of the Arctic Ocean. 
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Ratios of 7*'Pa,, to 7°°Th,. (Supplementary Table 3) in our late- 
glacial core sections (here defined as dating from 35 to 15 kyr ago, a 
period that ended with the start of North Atlantic Bolling-Allerad 
warming) are almost uniformly below the **'Pa/**°Th seawater pro- 
duction ratio of 0.093 (Fig. 2), indicating export of *°'Pa from all 
studied sites. The lowest ratios are seen in the deepest parts of box 
cores 16 and 17, which are more than 35 kyr old; these samples have 
large uncertainties in their modelled ages, and the low ratios in these 
oldest sediments may reflect under-correction for radioactive decay 
since deposition. There is an increase in 231 Da.,/7°°Th,, at the degla- 
ciation (here defined as 15-10 kyr ago) in five cores. In the deepest, box 
core 32, deglacial ratios remain low; in box core 26, at 1-km depth atop 
the Lomonosov ridge, ratios increase to the value of the seawater 
production ratio before 16.5 kyr ago. The 31a, /7°°Th,, ratios reach 
the production ratio, indicating no local export of ?31pa in the degla- 
cial intervals in the two other shallowest cores (box cores 08 and 16, at 
depths of 1-1.6 km) and remain at the production ratio through the 
Holocene epoch in box cores 08, 16 and 26. 

Holocene ratios in intermediate-depth cores vary between near- 
production-ratio levels similar to those in shallower cores and low 
ratios similar to those in deeper cores. Box core17, at a depth of 
2.2 km in the Makarov basin, and box core 28, at a depth of 2 km in 
the Amundsen basin, show synchronous millennial-scale variability 
in the deglacial and the early Holocene, with the highest peaks in 
'Pa,./°°Thys occurring ~11 kyr ago, followed by peaks ~7.5 kyr 
ago (Fig. 2). Surface-driven records such as vertical particle flux 
(Supplementary Fig. 1 and Supplementary Table 3) do not follow these 
?31pa, /7°°Th,, variations, and differ between basins, suggesting that 
31a, /7>°Th,, is more likely to be controlled by processes at depth 
such as ventilation and deep advection. The Arctic boundary current 
splits at the Lomonosov ridge, sending flows into the Amundsen basin 
and along the slope into the Makarov basin’; our sites at similar depths 
beneath this current in each basin may reflect its influence. 

Average ?31pa./7°°Th,, ratios for the late glacial, deglacial and 
Holocene show a strong linear relationship with water depth (Fig. 3). 
During each interval, ratios were highest in shallower (1-1.6 km) cores 
and lowest in deeper (>3 km) cores. The slope of this relationship is 
greater during the Holocene and deglacial than during the glacial, 


a b 
Cores above 2 km 


0 Cores between 2 and 3 km 


owing to greater differences in shallow ratios between these times. 
Although increased particle fluxes could lead to increased scavenging 
of 7°Pa and, thus, higher ratios, our observed average ratio-depth 
relationship is not accompanied by any apparent depth trend in 
*°°Th-normalized sediment mass fluxes (Supplementary Fig. 2); nor 
do particle fluxes seem to be a dominant control on *'Pa,,/7°°Th,, in 
individual cores (Supplementary Fig. 3). 

This depth relationship, which has characterized central Arctic 
231pa. ./7°°Th,, over the past 35 kyr, mirrors a depth relationship seen 
in surface sediments of other Arctic cores above 3.5-km water 
depth'™'>"” (Supplementary Fig. 4). Core-top 7*'Pa,./”°°Th,, mea- 
surements from 3.5-4.5-km water depth in the Eurasian basin’ show 
ratios higher than predicted by our depth trend, although all but one 
still fall below the production ratio. 

The relationship between 31daq,/7°Th,, and water depth that we 
find in the Arctic Ocean echoes patterns found outside the Arctic 
despite major dissimilarities in oceanographic setting. It resembles that 
constrained by ?31Da../°°Th,, surface sediments in other oceans 
including the eastern South Atlantic Ocean”, the North Atlantic 
Ocean*® and the Pacific Ocean”, suggesting that this relationship 
may be global and related to seawater ingrowth and particle scav- 
enging, rather than to local oceanography. The specific factors 
responsible for a relationship between sediment 31D a.</° Thy, and 
water depth remain unclear. Opal scavenges Pa more efficiently than 
do lithogenic or carbonate particles*”* and has been implicated in the 
ratio—depth relationship elsewhere”. Opal contents of samples from 
our Arctic cores (Supplementary Table 4) show little variation and no 
clear relation either to water depth or to 231Daq.,/7°°Th,« in the same 
samples, although this may be due to post-depositional dissolution on 
the sea floor. A simple reversible scavenging-mixing model”® suggests 
that, owing to lower activities of 31D, at shallower depths, less time is 
required for shallower waters to reach a steady state with respect to 
ingrowth and scavenging of **'Pa. Shorter residence times of *°'Pa in 
water bathing shallower core sites may lead to more balanced scav- 
enging between ~*'Pa and **° Th, which typically has a short residence 
time in sea water. 

No sink for the *°’Pa exported from our study sites has yet been 
found in the Arctic. Boundary scavenging, that is, the preferential 
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Figure 2 | 231 Dag | Ths activity ratio records from Arctic box cores. Cores 
are grouped by water depth: <2 km (a), 2-3 km (b), >3 km (c). Age models are 
based on radiocarbon dates; ratios are corrected for decay since deposition. 

Black points with error bars in the upper right corner of each box depict average 
range of 1o uncertainty, calculated from the standard deviation from the mean 
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of replicate 31a. />°Th,., measurements propagated with age uncertainty. 
Calibrated radiocarbon dates are shown at left in each plot (triangles). The 
dashed vertical lines show the production ratio of 7*'Pa/**’ Th in sea water. Grey 
shading indicates points older than 35 kyr. BC, box core. 
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Figure 3 | Average Holocene, deglacial and late glacial 7*'Pa,/**°Th,, 
activity ratios. Ratios shown in Fig. 2 were averaged for each core over the 
Holocene (0-10 kyr ago; a), deglacial (10-15 kyr ago; b) and late glacial (15- 
35 kyr ago; c). The dashed lines indicate a seawater production ratio of 0.093. 
Each core is coloured as in Fig. 2. Ratios decrease nearly linearly with increasing 


removal at higher-particle-flux margins of **'Pa imported from lower- 
particle-flux central basins, has been considered a likely mechanism of 
*'Pa removal from the Arctic water column'"5?’, Models of particle- 
reactive metal transport in the ocean support this proposition and 
suggest that *°°Th may also be subject to boundary scavenging in 
the most particle-poor parts of the Arctic’. Because the existing sedi- 
mentary evidence points to local removal of **°Th from sea water to 
sediments in the Makarov basin™, rather than lateral export of 230TH to 
margins, boundary scavenging of **Pa should lead to **’Pa,./7°° Thy. 
ratios greater than the production ratio at Arctic continental slopes. At 
the Alaska-Chukchi continental slope in the Canada basin, in which 
water residence times are greater than 600 yr (ref. 21), and near ter- 
restrial particle sources, core-top ratios are generally less than the 
production ratio’’ (Supplementary Information), indicating export 
of **'Pa produced at depth’ that is unbalanced within the Arctic 
proper. 

The export of **'Pa out of the Arctic'® could also explain low Central 
Arctic 7*'Pa,,/?°°Th,, ratios. The Fram strait is the only conduit 
through which deep water enters and exits the Arctic, and has a sill 
depth of ~2,500 m. No more than 10% of *°°Th and roughly 39% of 
3'Pa produced in the Arctic Ocean is at present exported in water 
masses exiting southwards through the Fram strait'*'*. Our results are 
consistent with a persistent net export of **'Pa, ranging from 30% in 
the Holocene to 40% during the Last Glacial Maximum (Methods). 
Persistent *°'Pa export is also consistent with continuous glacial inflow 
of warm Atlantic waters reaching intermediate depths during cold 
intervals*®, which may have provided salt and mass balance for this 
deep outflow. Because sedimentary 31D a, 79° Thx integrates over- 
lying influences’, the inferred deepening® of an inflow of low-**'Pa, 
low-7°°Th water, such as that from the Atlantic, might also have an 
impact on the burial of 7*'Pa,,/?°°Th,., especially at shallower and 
intermediate depths. 

Increased **'Pa export during the glacial might also be explained by 
even greater deep-water exchange through the Fram strait than exists 
today, although it more probably reflects longer residence times for 
dissolved Arctic **’Pa before removal by scavenging, and possibly a 
combination of these factors. Indeed, it seems likely that heavy per- 
ennial sea ice in the glacial Arctic led to diminished particle scavenging 
and, hence, an increase in dissolved 7*!Pa concentrations in the water 
column, resulting in higher *°'Pa concentrations in water exiting the 


depth during all three climatic periods investigated, although the gradient of 
this decrease is smaller during the glacial (owing to lower ratios at shallow sites) 
than during the deglacial and Holocene. Error bars, 1 s.e.; r’, coefficient of 
determination; z, water depth. 


Arctic. Low sedimentation rates and *°°Th-normalized sediment mass 
fluxes (Supplementary Information), as well as reductions in metal- 
liferous deposition”’, in glacial-age Arctic sediments are consistent 
with this hypothesis. The ~40% decrease in glacial scavenging inten- 
sity suggested by lower particle fluxes (Supplementary Information) 
could counter the ~33% increase in 7°*'Pa export, to result in a small 
net reduction (<10%) in glacial deep-water outflow through the Fram 
strait, although the uncertainties associated with these estimates are 
too large to resolve a change of this scale. 

The eventual fate of the *'Pa exported from the Arctic is presently 
unknown. Although **'Pa burial could occur farther south in the 
Nordic seas or North Atlantic, the Fram strait, with higher particle 
fluxes due to seasonal productivity, sea-ice transport and nearby ter- 
rigenous particle sources, could act asa likely sink for the *°Pa exported 
from the Arctic. 

The combined observations of a balanced Arctic **°Th budget and 
persistent sedimentary **'Pa,, deficits provide constraints on the time- 
scale of **'Pa removal from the basin. Residence times with respect to 
removal by scavenging for these nuclides in the water column in other 
oceans (10-20 yr for 230TH, 50-200 yr for ?31Da: refs 29, 30) indicate 
that the deep waters of the Arctic must be exchanged through the Fram 
strait on centennial timescales. Especially given the evidence for vari- 
able and possibly longer residence times in sea water for **'Pa and 
*°Th in the Arctic’, any decadal-scale exchange would lead to sedi- 
mentary OTe. deficits that are not observed". Longer, millennial, 
timescales of exchange would not allow the widespread net sediment- 
ary ~*'Pa,, deficit evident in core tops!"5” and our new down-core 
results. Although our evidence for greater glacial export of **'Pa is 
more likely to reflect diminished scavenging than enhanced deep out- 
flow, the persistent outflow from a presumably perennially ice-covered 
Arctic suggests that the same sea ice and brine formation processes as 
today probably occurred in recurring open waters at the basin margins. 
The century-scale deep-water exchange is consistent with a residence 
time for seawater in the modern Arctic of 120-360 yr (Methods), 
assuming a volume of 11.455 X 10°km® and a volume transport 
through the Fram strait of approximately 1-3 Sv (refs 3, 15). It is also 
consistent with estimates of the ‘age’ of deep waters in the Arctic 
basins*°**". Because the most important component of the Fram strait 
exchange for Arctic **'Pa export is the intermediate and deep outflow, 
the results presented here provide a proxy for reconstructing past 
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dynamics of the Arctic Ocean at depth, and indicate that the outflow 
through the Fram strait may have persisted through the very different 
conditions and dramatic climate changes since the most recent glacial. 


METHODS SUMMARY 


Cores were sampled in 0.5-cm slices every 1 or 0.5 cm. Sediments were spiked with 
*3Pa and »°Th for isotope dilution analysis and digested in HCIO,, HF and 
HNO. Aliquots were spiked with **°U and *°Th for measurement of **°U and 
*°2Th. Thorium and Pa fractions were purified by anion-exchange column chro- 
matography and analysed on a Thermo-Finnegan Element 2 inductively coupled 
plasma mass spectrometer”® (ICP-MS). Measurements of ***U and 7**Th were 
used to estimate supported detrital and ingrown **'Pa and **°Th activity. 
Corrections assuming respective detrital **°U/?**Th ratios of 0.5 and 0.7 were 
applied to each sample to calculate a range of corrected 31a, and ~°°Th,, values 
and their ratios’*; these ratio values are reported in Supplementary Table 3, and 
average values based on these ranges are plotted in Fig. 2 and Supplementary Fig. 1. 
Excess values were also corrected for radioactive decay since deposition and 
ingrowth from authigenic U estimated using the same values for **°U/?**Th. 

The uncertainty associated with ICP-MS analysis is estimated to be less than 2% 
for both the **'Pa and the *°Th analysis. The uncertainty in our age models 
produces an uncertainty in age-corrected 31a. f° Thy. ratios of less than 3% 
for age errors up to +2 kyr, incorporating uncertainties in ‘4C analyses, reservoir 
correction and calendar age calibration. The average lo uncertainty in 
1pa../°Th,,, calculated from replicate analyses (Supplementary Table 5) and 
incorporating age uncertainty, was ~8%. 

The data reported here, and tabulated in Supplementary Information, will be 
archived in the US NOAA Paleoclimatology database. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Sampling and isotopic analysis. Cores were sampled in 0.5-cm slices every 1 cm 
(all box cores except 17) or 0.5 cm (box core 17). Sediments were spiked with ***Pa 
and *°Th for isotope dilution analysis and digested in HCIO,, HF and HNO3. 
Aliquots were spiked with 7°°U and *”’Th for measurement of **°U and 7’Th. 
Thorium and Pa fractions were purified by anion-exchange column chromato- 
graphy and analysed on a Thermo-Finnegan Element 2 ICP-MS*". Repeated mea- 
surements of *°U/?*U ratios in the U standard solution NBS960 were used to 
correct for isotope mass fractionation during ICP-MS analysis. Measurements of 
*°8U and **’Th were used to estimate supported detrital and ingrown **'Pa and 
*°°Th activity. Corrections assuming respective detrital **°U/?”’Th ratios of 0.5 
and of 0.7 were applied to each sample to calculate a range of corrected **'Pa,, and 
Th... values and their ratios; these ratio values are reported in Supplementary 
Table 3. Average values based on the corrected ranges are reported in 
Supplementary Table 3 and plotted in Fig. 2 and Supplementary Fig. 1. Excess 
values were corrected for effects of both radioactive decay since deposition and 
authigenic ingrowth. (Published core-top excess ratios from other studies dis- 
cussed in this paper have not been corrected for decay since deposition.) 

The uncertainty associated with ICP-MS analysis is estimated to be less than 2% 

for both the **'Pa and the **°Th analysis. The uncertainty in our age models 
produces an uncertainty in age-corrected **'Pa,,/**°Th,, ratios of less than 3% 
for age errors up to +2 kyr. The average 1o uncertainty in **'Pa,,/?*°Th,, calcu- 
lated from replicate analyses was ~8%. 
Core chronologies. We constructed age models by linear interpolation between 
radiocarbon-dated intervals, using both published dates’*”’ and dates new to this 
study. Radiocarbon ages were converted to calendar years using the CALIB 5.2 
program” and the MARINECAL 04 marine calibration data set”’ for ages less than 
22 kyr, and the calibration from ref. 34 for greater ages*’. A local difference in 
reservoir age of AR = 40yr, to provide a reservoir age of 440 yr, following the 
studies in which a number of the dates used here were published'*!**°, plus an 
additional 250-yr AR (refs 36-38), to account for the likely additional influence of 
sea ice on air-sea gas exchange, were applied to all ages. 

Published radiocarbon dates were measured from samples of the polar plank- 
tonic foraminifera Neogloboquadrina pachyderma sinistral, for box cores 08, 16 
and 17'* (from the same subcores that we sampled for nuclide records). Published 
dates for box core 20’” were measured in a different subcore from that used for 
nuclide sampling. Although stratigraphy can vary by several centimetres between 
subcores of the same box core, our coarse fraction weight per cent record in 
box core20 matches the published record’’ quite well, suggesting little stra- 
tigraphic offset between that subcore and ours. Dates for box cores 28 and 32 
(R. Poore, personal communication) were produced from the same subcores as 
our radionuclide records. Dates from box core 26 (this study) were measured from 
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N. pachyderma sinistral at the NOSAMS facility at Woods Hole Oceanographic 
Institution. 
Additional methods. Biogenic silica contents (Supplementary Table 4) of selected 
samples were measured at the Lamont-Doherty Earth Observatory using standard 
methods”. 

230Th normalized sediment mass fluxes“ to the seafloor were calculated 
according to the equation F = fz/**°Th,,, where f is the production rate of 
0Th in seawater (0.0267 d.p.m.m *kyr~'), z is the water depth of the site (in 
metres) and F is reported in units of gem “kyr. 

Our estimates of persistent Holocene and glacial export of **'Pa from the Arctic 
Ocean as a whole were calculated using estimates of Arctic water volume, at 
0.5-km depth increments, inferred from Arctic basin hyposometry*’, and the 
average ~*!Pa export within each such water depth increment was inferred from 
31pa../?*°Th,, ratios and the production ratio of ?31Ba and *°Th in sea water. The 
residence time of sea water within the Arctic (120-360 yr) was calculated by 
dividing the Arctic basin volume’! (11.455 X 10°km?) by the outflow through 
the Fram strait'® (1-3 Sy). 
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Emergence of two types of terrestrial planet on 
solidification of magma ocean 


Keiko Hamano!, Yutaka Abe! & Hidenori Genda!? 


Understanding the origins of the diversity in terrestrial planets is a 
fundamental goal in Earth and planetary sciences. In the Solar 
System, Venus has a similar size and bulk composition to those 
of Earth, but it lacks water’ *. Because a richer variety of exoplanets 
is expected to be discovered, prediction of their atmospheres and 
surface environments requires a general framework for planetary 
evolution. Here we show that terrestrial planets can be divided into 
two distinct types on the basis of their evolutionary history during 
solidification from the initially hot molten state expected from the 
standard formation model*”. Even if, apart from their orbits, they 
were identical just after formation, the solidified planets can have 
different characteristics. A typeI planet, which is formed beyond a 
certain critical distance from the host star, solidifies within several 
million years. If the planet acquires water during formation, most 
of this water is retained and forms the earliest oceans. In contrast, 
on a type II planet, which is formed inside the critical distance, a 
magma ocean can be sustained for longer, even with a larger initial 
amount of water. Its duration could be as long as 100 million years 
if the planet is formed together with a mass of water comparable to 
the total inventory of the modern Earth. Hydrodynamic escape 
desiccates typeII planets during the slow solidification process. 
Although Earth is categorized as typeI, it is not clear which type 
Venus is because its orbital distance is close to the critical distance. 
However, because the dryness of the surface and mantle predicted 
for type II planets is consistent with the characteristics of Venus, it 
may be representative of type II planets. Also, future observations 
may have a chance to detect not only terrestrial exoplanets covered 
with water ocean but also those covered with magma ocean around 
a young star. 

Theoretical studies on planet formation suggest that Earth-sized 
planets (Earth and Venus) should form as a result of giant impacts 
between protoplanets*, and probably all start their lives in a globally 
molten state’. The earliest phase of planetary evolution is thus solidi- 
fication of a magma ocean, which provides the initial conditions for 
mantle differentiation and distribution of volatiles between the interior 
and the surface°. The timing of the end of this phase also determines 
the starting point for subsequent events such as water ocean formation, 
and possibly the onset of plate tectonics and the development of life. 

The thermal evolution of a magma ocean is closely linked to the 
formation of a steam atmosphere®*. A massive steam atmosphere 
decreases outgoing radiation from the planet through its strong green- 
house effects, and thus delays the solidification process. Degassing 
from a solidifying magma ocean can in turn greatly increase the 
amount of the steam atmosphere’. Therefore, evaluating the outgoing 
radiation during atmospheric evolution is the key to understanding the 
earliest thermal history of the planet. 

Particularly for a hot steam atmosphere above a magma ocean, 
previous studies have shown that the outgoing radiation has a lower 
limit (about 300 W m ”) owing to saturation of the upper troposphere 
with water vapour’ ''. This is because its thermal structure is uniquely 
determined by the saturated-vapour-pressure curve of water (see 


ref. 11 for a further explanation). Because the net incoming stellar 
radiation in the zone of formation of terrestrial planets can be smaller 
or larger than this radiation limit, it can be speculated from heat 
balance that the presence of such a limit would produce an evolution- 
ary dichotomy between terrestrial planets. The effect of the radiation 
limit, however, was neglected in a previously proposed coupled model 
for a deep magma ocean and steam atmosphere’. In addition, the 
greenhouse effect of water vapour has not been fully considered. 
Here we perform radiative-convective equilibrium calculations to 
enable our model to be used to calculate the radiation limit for a satu- 
rated steam atmosphere. In addition we take into account the water loss 
associated with hydrodynamic escape"’, which is expected to occur in 
parallel with magma-ocean solidification. Competition between degass- 
ing and water loss determines whether the steam atmosphere grows or 
escapes. This would affect not only the thermal evolution of planets but 
also the planetary water inventory at the time of complete solidification. 
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Figure 1 | Typical evolution of a type I planet. Evolution ofa planet located at 
1 AU, with an initial water mass of five times the current ocean mass on Earth 
(Mgo, 1.4 X 10°' kg). The grey dotted lines in the bottom panel indicate the 
radiation limits''. At ~0.7 Myr, the planetary radiation reaches the 
tropospheric radiation limit. After that, the heat flux from the magma ocean 
(MO) becomes constant. This results in rapid solidification at ~4 Myr. The 
solidification time and final water partitioning are comparable to those 
reported in ref. 6 for a MO 2,000 km deep on Earth and a total water mass of 
~10Mgo. 
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Figure 1 shows the typical evolution of an Earth-sized planet located 
at a distance of 1 Au from its parent star. Because of the high solubility 
of water in silicate melts, a sizeable fraction of the water dissolves in the 
initial deep magma ocean. As the magma solidifies, degassing of water 
from the interior leads to an increase in the atmospheric mass. At 
about 0.2 million years (Myr), because of the growth of the atmosphere 
combined with the decrease in temperature, the atmosphere starts to 
be saturated with water vapour at the tropopause. As the saturation 
front moves deeper into the troposphere, the planetary radiation 
decreases until it finally becomes equal to the tropospheric radiation 
limit at about 0.7 Myr. The outgoing radiation flux then remains con- 
stant throughout the subsequent solidification period. 

The overall solidification time is about 4 Myr in this case, and is 
mainly determined by the minimum net heat flux; this is defined as the 
difference between the radiation limit and the net incident stellar 
radiation. The existence of a minimum heat flux after saturation gives 
rise to rapid solidification because further atmospheric growth has no 
effect on the cooling rate of the planet. Hydrodynamic escape makes 
little contribution to the water inventory during the short magma- 
ocean period. We refer to this type of planet as typeI. 

This mechanism does not apply to the evolution ofa planet at 0.7 au 
(Fig. 2), in which the net incident stellar radiation exceeds the tro- 
pospheric radiation limit. In the earlier stages up to ~0.2 Myr, its 
evolution is similar to that of a typeI planet. However, at ~1 Myr 
the outgoing radiation almost balances the net incident stellar radi- 
ation before a sufficiently deep part of the troposphere becomes satu- 
rated. This results in an extremely low heat flux so that the magma 
ocean is sustained for about 100 Myr, much longer than the value of 
4 Myr at 1 Au, for an initial water amount of five times the current 
ocean mass on Earth (5Mgo). During this period there is a large 
decrease in the total water inventory of the planet. 

The solidification rate is mainly governed by the water loss rate due 
to hydrodynamic escape. Because the outgoing radiation must exceed 
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Figure 2 | Typical evolution of a type II planet. As in Fig. 1, but for an orbital 
distance of 0.7 Au. The net incident stellar radiation is larger than the 
tropospheric radiation limit. At ~1 Myr, planetary radiation almost balances 
the net stellar radiation. The subsequent heat flux from the MO is controlled by 
the rate of water loss. The dashed black line in the top panel represents the 
upper limit of atmospheric pressure for cooling, for which the planetary 
radiation is equal to the net stellar flux. The MO period is ~100 Myr, much 
longer than that for a typeI planet. 
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the incident stellar radiation for cooling, the steam atmosphere must 
be optically thinner for lower surface temperatures (Fig. 2), so that a 
net loss of the steam atmosphere is required. We refer to this type of 
planet as type II. 

These two types of planet also have a different partitioning of water 
between the atmosphere and the interior (Fig. 3). For the type I planet 
at 1 Au, the total water inventory becomes less susceptible to loss for 
larger initial water masses. The primary water reservoir is the steam 
atmosphere, whereas the deep interior accounts for only a small per- 
centage of the planetary water inventory. In contrast, the type II planet 
at 0.7 Au becomes desiccated during the longer magma-ocean period. 
The final total water inventory is less than 0.1Mgo even for an initial 
value of 10Mgo. In addition, as the initial water inventory increases, 
the interior reservoir becomes larger than that of the atmosphere. 

Figure 4 shows the solidification time and final total water inventory 
for Earth-sized planets within the terrestrial planet formation zone. 
The solidification time (see Supplementary Information for analytical 
expressions) is seen to peak at about 0.8 Au. Planets located beyond this 
range are classified as type I, whereas those inside it are classified as 
type II. TypeI planets have maximum solidification times typically as 
short as several million years owing to the minimum heat flux. Because 
only a limited amount of hydrodynamic escape can occur within such 
a short period, it has a reduced impact on the thermal history and 
water budget of the planet. Soon after solidification, water oceans 
would probably form on the surface. 

For a type I planet, the initial total water inventory simply affects the 
volume of the earliest oceans, whereas it has a strong influence on the 
solidification time for a type II planet. As seen in Fig. 4, the duration of 
the magma-ocean phase agrees well with the time required for total loss 
of the primordial water. This reflects the fact that the type II planet must 
lose water to cool down, and consequently a larger water endowment 
results in a longer magma-ocean period. The final total water inventory 
of type II planets never exceeds 0.1 Mgo; irrespective of the initial value. 
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Figure 3 | Water partitioning between steam atmosphere and planetary 
interior. The amount of water depends on the rate of loss and the duration of 
the MO period. a, Planet at 1 Au (typeI). Most of the primordial water remains 
and contributes to the steam atmosphere at the time of complete solidification. 
b, Planet at 0.7 Au (type II). The final total mass of water is less than 0.1Mgo. 
The planetary interior may become the dominant reservoir if water is 
incorporated before the silicate melts become desiccated. The effect of a surface 
thermal boundary layer is discussed in Supplementary Information. 
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Figure 4 | Two distinct types of terrestrial planet. The upper x axis shows the 
corresponding initial net stellar radiation. The arrow indicates the tropospheric 
radiation limit. The critical orbital distance a., of ~0.76 AU separates the orbital 
regimes of the two types of planet. a, Solidification time. The dotted lines show 
the time required for complete loss of primordial water. This provides a good 
approximation of the solidification time of type II planets. The maximum 
solidification time for typeI planets is also shown (see Supplementary 
Information). b, Total water inventory at the time of complete solidification. A 
strong transition is exhibited at about a... 


The different mechanisms determining the solidification rate thus also 
lead to a sharp transition in the planetary water budget. 

Whether a type I or a type II planet is formed around a star depends 
on the critical orbital distance a,,, which is defined as the distance at 
which the net stellar flux that the planet receives equals the tro- 
pospheric radiation limit F;;,,, of the steam atmosphere, 


e.~0.76(—ttim_ “1 (Sai(t0)\ 7 (1a) AU (1) 
“ ""~\294 W m-? 0.7So 1—0.3 


where S,i(To) is the stellar constant at a stellar age t) when solidifica- 
tion starts. So is the current solar constant and « is the planetary 
albedo. a,,, which is about 0.76 in our model, can range from about 
0.6 to 0.8 AU, as a result of parameter uncertainties and the atmospheric 
model used. 

Earth is located at a distance of 1 AU from the Sun, which is suffi- 
ciently far for it to be a typeI planet. Its deep interior would probably 
have solidified within about 5 Myr, even if it had acquired as much or 
more water than the current total inventory (~1.25-5Mgo)"’. This 
would support the rapid solidification and ocean formation condition 
suggested from geochemical data'*'*. The overall water budget would 
have been only slightly affected by hydrodynamic escape, but this mod- 
est loss might still have contributed to oxidization of the interior through 
selective escape of hydrogen over oxygen. Ifa simple assumption is made 
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that all of the dissociated oxygen atoms were consumed by oxidation of 
ferrous iron, the bulk Fe,0; content in the early mantle would have 
increased by up to about 0.07wt% during the first several million years 
of Earth’s history. This value is comparable to the current Fe,O; content 
in the upper mantle (>0.2wt%)'° and is consistent with the idea of an 
oxidized mantle possibly as early as 4.4 Gyr ago (ref. 17). 

Because Venus’s orbital distance of about 0.72 au places it on the 
border zone, it is not exactly clear which planetary type it should be 
classified as. However, the significant water loss for a type II planet 
during the magma-ocean period might explain its dry interior and the 
fate of the remnant oxygen. The absence of water on its surface’ and 
interior*” is conventionally explained in terms of an inherently dry 
origin'® or hydrodynamic escape after ocean evaporation”’. The latter 
situation corresponds to the case in which Venus is a typeI planet in 
our classification. In this case, the dryness of its interior depends on 
degassing efficiency during the subsequent evolution. Alternatively, if 
Venus belongs to type II, the occurrence of planetary desiccation dur- 
ing solidification can explain the dryness of its current mantle””, even if 
it accreted with a certain amount of water. The presence of a dry 
mantle during the earliest stages might also have led to a subsequent 
tectonic evolution different from that of Earth”®’! and possibly to the 
absence of a magnetic field”. 

One problem associated with the hydrodynamic escape of water is 
the accumulation of dissociated oxygen in the atmosphere as a result of 
the preferential escape of hydrogen”’, which would possibly lead to a 
shutdown of the hydrogen escape process itself. However, if Venus 
belongs to type II, the remnant oxygen issue can be avoided, regardless 
of the initial amount of water, because the abundant surface magma 
can act as a massive oxygen sink. Although the effects of other gaseous 
species such as carbon dioxide will need to be addressed in future work, 
the slight difference in the orbital distances of Earth and Venus might 
have been the critical factor making these planets totally different after 
solidification. 

Although we have so far discussed only Earth-sized planets orbiting 
a Sun-like star, the conclusions can be qualitatively extended to all 
terrestrial planets that experienced global melting through giant 
impact. More than 750 exoplanets have already been confirmed”, 
and 2,300 candidates** have been identified. Future space missions 
such as the Terrestrial Planet Finder and Darwin will provide us with 
a catalogue of atmospheres and surface environments for terrestrial 
planets of various ages. The present results indicate that for habitable 
planets, rapid ocean formation would have occurred within several 
million years of planet formation. In addition, the prediction that 
type II planets undergo a long-lived (~100 Myr) magma-ocean stage 
suggests that, apart from permanently molten planets very close to the 
host star, there may be a chance of detecting still-molten terrestrial 
planets as a consequence of planetary formation, especially around 
young stars. Although the molten surface would be hard to detect 
through the thick atmosphere, it can in principle be identified by 
detecting gaseous species such as NaOH and KOH that are indicative 
of hot atmospheres”®. 


METHODS SUMMARY 

Water reservoirs and budgets. Partition of water between planetary reservoirs 
was treated in basically the same way as in ref. 6. We used water solubility in 
basaltic-composition melts calculated from a solubility model in ref. 27. Water 
partition coefficient into cumulates and mass fraction of interstitially trapped 
melts were fixed at 10 * and 0.01, respectively. 

The water-loss rate was calculated from energy-limited escape rate of hydro- 
gen’? with a heating efficiency of 0.1 by assuming that all the dissociated oxygen 
would have been absorbed into the magma ocean. For the effects of the higher 
heating efficiency, see the online-only Methods and Supplementary Information 
section 4. We adopted an observational compilation of Sun-like stars for time- 
dependent extreme ultraviolet radiation”*. 

Thermal evolution model. Assuming that the atmosphere has a negligible heat 
capacity, the thermal evolution model was formulated on the basis of heat balance: 
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dT 
Cp pil Tm) “dt. —— 4nR? (Foi —Fwu) (2) 


where C,, »i(Tm) is the heat capacity of the planet at potential temperature T,,, of the 
interior, and R is the planetary radius. F,, and F,, are the outgoing planetary 
radiation and the incoming net stellar radiation, respectively. Calculation starts 
from temperature of 3,000 K, which corresponds to a mostly molten state of the 
mantle, and stops when it reaches the solidus temperature at the surface 
(~1,370 K). We assumed that the whole silicate portion could transfer heat by 
convection and therefore contribute to the heat capacity. We obtained the heat 
capacity, including latent heat of fusion, as a function of potential temperature, 
from its adiabatic temperature profile calculated according to ref. 29. 

The planetary radiation was calculated with a modified radiative-convective 
equilibrium model of grey atmosphere’’. We also performed non-grey calculations 
(Supplementary Information). We adopted a solar standard model for luminosity 
enhancement with stellar age’®. A planetary albedo of 0.3 was assumed for the net 
stellar radiation. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Water reservoirs and water budget. The partitioning of the total water inventory 
between planetary reservoirs was treated in basically the same way as in ref. 6. 
Three water reservoirs were considered: atmosphere, magma ocean (a wholly or 
partly molten region) and solid mantle. The partial pressure of water was calcu- 
lated at each time step by assuming a solution equilibrium between the atmosphere 
and the silicate melts at the surface. The water solubility values used were for 
basaltic melts taken from ref. 31 and were calculated by using the model reported 
in ref. 27 for a temperature of 2,000 K (see Supplementary Information for the 
effect of this degassing assumption). 

For the incorporation of water into solidifying silicates, two processes were 
considered: interstitial trapping of water-enriched melts and water partitioning 
between silicate melts and cumulates. The mass fraction of interstitially trapped 
melts was assumed to be 1%, as in ref. 6. Although the actual value for a solidifying 
magma ocean is highly uncertain, a value of 1% is likely to be an overestimate, 
taking into account the degree of melting estimated for abyssal peridotites when 
melt separation begins**. Few experimental results have been reported on the 
partition coefficient of water for minerals such as perovskites that make up the 
deep mantle. We adopted a water partition coefficient of 10~* in our calculations. 
However, it should be noted that increasing this to 10 7 increases the amount of 
water in the interior at the time of complete solidification by only a factor of 2. This 
is because interstitial trapping of water-enriched melts dominates. 

Because all surface water is in the vapour phase on a solidifying planet, intense 

hydrodynamic escape of hydrogen is expected to occur during the magma ocean 
period. We also considered a reduction in the total amount of water due to 
hydrodynamic escape. The water loss rate was calculated from the energy-limited 
escape rate for hydrogen’ by assuming that all of the oxygen produced by dis- 
sociation of water vapour would be lost from the atmosphere by oxidation of the 
magma ocean. Extreme ultraviolet radiation levels for the host star were based on 
compiled observational data for Sun-like stars, taken from ref. 28, in which the 
decay with time is expressed as a power-law function of stellar age. A heating 
efficiency of 0.1 was used throughout all the calculations. Using the higher heating 
efficiency has less effect on the evolution of typeI planets, whereas the solidifica- 
tion time of type II planets becomes shorter in inverse proportion to the heating 
efficiency (see Supplementary Information). 
Thermal evolution model. A high degree of melting implies an extremely tur- 
bulent magma ocean with a viscosity probably as low as 0.1 Pa (ref. 33). This 
suggests that the magma ocean is effective in transporting internal heat to the 
surface. Planetary radiation into space would therefore be the main factor limiting 
the heat flux from the deep magma ocean. We assumed that the heat flux is given 
by the difference between the outgoing planetary radiation F,; and incoming net 
stellar radiation F,,. On the assumption that the atmosphere has a negligible heat 
capacity, a thermal evolution model can be formulated on the basis of the heat 
balance for a planet, 


dT 
Cppi(Tm) Ge = —4nR? (Fy — Feu) (2) 


where C,, (Tm) is the heat capacity of a planet with an interior potential temper- 
ature T,,, and R is the planetary radius. Heat flux from the core and heat produc- 
tion by radiogenic elements are neglected. The simple assumption is made that the 
potential temperature matches the surface temperature T,—that is, T,, = T,— 
during the solidification period of the magma ocean. The calculation starts with 
potential and surface temperatures of 3,000 K, implying a mostly molten mantle, 
and stops when the solidus temperature at the surface (~1,370 K) is reached. 
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We calculated the planetary heat capacity, including the latent heat of fusion, as 

a function of the potential temperature by assuming that the entire silicate portion 
can transfer heat by convection, and not only the molten part as considered in 
ref. 6. The heat capacity calculated in this way corresponds to the maximum value, 
thus providing a conservative estimate of the maximum solidification time for a 
typeI planet. However, both assumptions about the interior temperature profile 
result in an almost identical solidification time for a type II planet, because the 
solidification rate is mainly determined not by the heat capacity but by the water 
loss rate. The adiabatic thermal structure in the interior was calculated according 
to ref. 29 by using the parameters therein, except for recent experimental data 
for the solidus and liquidus temperatures at great depth (see Supplementary 
Information). Differentiation of the mantle and its mineralogy on solidification 
were not considered in the calculations of the temperature profile, because these 
would be expected to make a smaller contribution to the overall heat capacity of 
the planet. See the next section for calculations of planetary radiation and net 
stellar radiation. 
Atmospheric model. Planetary radiation was calculated by using a modified 
version of the radiative-convective equilibrium model in ref. 11, which was ori- 
ginally designed for a steam atmosphere saturated with water vapour and for 
moderate surface temperatures. However, in the present study the surface tem- 
peratures considered are extremely high, exceeding the critical temperature of 
water. In addition, the relative humidity of the atmosphere can vary because the 
partial pressure of water is determined by solution equilibrium at the surface. We 
have extended the grey atmosphere model in ref. 11 to be applicable to the con- 
ditions considered herein. The critical point of water was simply treated as the 
temperature above which no water condensation would occur. For atmospheric 
layers that were unsaturated by water vapour or for which the temperature was 
above the critical point, the temperature lapse rate in the troposphere was assumed 
to be a dry adiabatic temperature gradient, whereas for saturated parcels the lapse 
rate was assumed to be a pseudo-moist adiabatic temperature gradient, as in 
ref. 11. The atmosphere is considered to consist of 1 bar of N, in addition to water 
vapour, the amount of which is determined by the water budget. See 
Supplementary Information for results calculated with a non-grey atmospheric 
model in ref. 9, instead of our simple grey model. 

For calculations of the net stellar radiation, the solar standard model*’ was 
adopted to describe the change in stellar luminosity with time. The planetary 
albedo was fixed at 0.3 in all calculations. The actual value of planetary albedo is 
highly uncertain, especially because of the unpredictability of cloud coverage with 
a one-dimensional atmospheric model. In the previous studies (refs 9, 10) a 
planetary albedo for a thick steam atmosphere was estimated as large as 0.4-0.5, 
whereas a recent study™* with a non-grey atmospheric model using the HITEMP 
database has shown that including weak HO lines significantly lowers the plan- 
etary albedo. 
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Palaeontological evidence for an Oligocene 
divergence between Old World monkeys and apes 
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Apes and Old World monkeys are prominent components of modern 
African and Asian ecosystems, yet the earliest phases of their evolutio- 
nary history have remained largely undocumented’. The absence of 
crown catarrhine fossils older than ~20 million years (Myr) has stood 
in stark contrast to molecular divergence estimates of ~25-30 Myr 
for the split between Cercopithecoidea (Old World monkeys) and 
Hominoidea (apes), implying long ghost lineages for both clades**. 
Here we describe the oldest known fossil ‘ape’, represented by a 
partial mandible preserving dental features that place it with ‘nyan- 
zapithecine’ stem hominoids. Additionally, we report the oldest stem 
member of the Old World monkey clade, represented by a lower 
third molar. Both specimens were recovered from a precisely dated 
25.2-Myr-old stratum in the Rukwa Rift, a segment of the western 
branch of the East African Rift in Tanzania. These finds extend the fossil 
record of apes and Old World monkeys well into the Oligocene epoch of 
Africa, suggesting a possible link between diversification of crown cat- 
arrhines and changes in the African landscape brought about by previ- 
ously unrecognized tectonic activity’ in the East African rift system. 

The late Oligocene represents the least-sampled temporal interval in 
primate evolutionary history’, with only a handful of primates described 
from all of Afro-Arabia® '°. Possible reasons for an end-Palaeogene gap 
in the fossil record include limited deposits of appropriate age, particularly 
from Africa below the equator, complicated by densely vegetated topo- 
graphy in more tropical environments. Asa result of this sampling bias, 
detailed understanding of the early diversification of Old World monkeys 
and apes has remained elusive. In particular, fossils from this interval are 
critical for testing the hypothesis of a late Palaeogene (~25-30 Myr) 
hominoid-cercopithecoid divergence, a result repeatedly retrieved by 
molecular studies**. Recent discoveries from the Rukwa Rift Basin in 
southwestern Tanzania provide critical data for testing these ideas by 
revealing a novel glimpse into late Oligocene terrestrial ecosystems 
from Africa below the equator’®’. 

The Rukwa Rift Basin (Fig. 1) records one of the thickest accumula- 
tions of sedimentary rock in the entire East African rift system (EARS)". 
Work over the past decade has documented a complex and long-lived 
history of the western branch of the EARS*"' containing continental 
faunas of both Cretaceous’ and Oligocene” age, the latter spanning ~24- 
26 Myr (see Supplementary Information, section 1). Palaeontological 
field research in 2011-12 resulted in the recovery of two well-preserved 
primate fossils from the Nsungwe 2B locality. These discoveries provide 
critical data for resolving disparities between molecularly derived diver- 
gence estimates and the primate fossil record. 


Primates Linnaeus, 1758 
Anthropoidea Mivart, 1864 
Catarrhini Geoffroy, 1812 
Cercopithecoidea Gray, 1821 
Nsungwepithecus gen. nov. 


Etymology. Prefix “Nsungwe’ in reference to the name of the geo- 
logical formation from which the specimen was recovered; ‘pithecus’ 
a common primate suffix derived from the Greek pithekos (ape). 
Diagnosis. Differs from all other fossil cercopithecoids in exhibiting the 
following combination of features on the lower third molar (M3): larger 
than all known ‘victoriapithecids’ other than Zaltanpithecus*"*; unbi- 
furcated mesial root; low rounded cusps with pronounced buccal flare; 
hypoconulid pronounced and centrally positioned; deeply incised distal 
buccal cleft that extends to the crown base; incomplete bilophodonty, 
with a small notch in the lophid connecting the protoconid and meta- 
conid and absence of a hypolophid connecting the entoconid and 
hypoconid; lower degree of mesiodistal elongation and basal inflation 
than in Noropithecus'*; marked buccal enamel wrinkling extending 
onto the median buccal ridge; shallow and crenulated lingual notch; 
and a proliferation of accessory cuspulids along the postmetacristid 
and around the entoconid, including three cuspulids situated in the 
talonid basin. 
Nsungwepithecus gunnelli sp. nov. 

Etymology. Specific name is in honour of Gregg F. Gunnell for his 
many contributions to primate palaeontology. 
Holotype. RRBP (Rukwa Rift Basin Project) 11178, left partial mandible 
preserving M; (Fig. 2b; see also Supplementary Information, section 2). 
Locality and horizon. Oligocene Nsungwe Formation, locality Nsungwe 
2B, near the town of Mbeya, southwestern Tanzania (Fig. 1a). The site 
is situated 30 m above the contact between the Utengule and Songwe 
members of the Nsungwe Formation. The age of the fossil-bearing unit 
is tightly constrained between two volcanic tuffs dated by U-Pb CA- 
TIMS (U-Pb chemical abrasion thermal ionization mass spectrometry) 
geochronology at 25.237 and 25.214 Myr ago (Fig. 1c and Supplemen- 
tary Information). 
Diagnosis. As for genus. For additional description and metrics, see 
Supplementary Information. 

Hominoidea Gray, 1825 

Rukwapithecus gen. nov. 
Etymology. Prefix ‘Rukwa’ in reference to the Rukwa Rift Basin from 
which the specimen was recovered’; ‘pithecus’ a common primate 
suffix derived from the Greek pithekos (ape). 
Diagnosis. Differs from all other extinct catarrhines in the following 
combination of characters: long, high-crowned and obliquely implanted 
lower fourth premolar, with mesial basin elevated high above the longer 
and more distolingually oriented talonid basin; mesiodistally elongate 
and crenulated lower molars that increase in length and width distally, 
and that are subrectangular and waisted in outline; mesial position of 
lower molar protoconid and hypoconid relative to lingual cusps; deep 
lower molar hypoflexid formed in part bya pronounced buccal cingulid 
that surrounds the protoconid mesially, but blends onto the buccal 
surface of the hypoconid; small lower molar metastylid (=mesoconid") 
distolingual to the metaconid that is more pronounced on M,-M3; 
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* Nsungwe 
River section 


Figure 1 | Location and stratigraphy of the primate-bearing locality 
(Nsungwe 2B), in southwestern Tanzania. a, Digital elevation model of the 
study area based on Shuttle Radar Topography Mission data (SRTM). Inset 
map highlights the position of the Rukwa Rift Basin in eastern Africa (yellow 
oval). b, Measured stratigraphic section through the Nsungwe Formation, 
showing the position of Nsungwe 2B in yellow with the positions of two 


accessory cuspules in lingual notches of M; and M3; deep lingual notch 
and distolingual fovea on M,-M3; large, well-individuated and buccally 
positioned hypoconulid on M,-M;; cresting between entoconid and 
hypoconulid weak or absent on M,-M3; and M; massive and highly 
crenulated, slightly tapering to a broad and rounded distal margin. 


Rukwapithecus fleaglei sp. nov. 


Etymology. Specific name is in honour of John G. Fleagle, for his many 
contributions to the study of primate morphology, behaviour and 
evolution. 

Holotype. RRBP 12444<A, a right mandible bearing lower fourth premolar 
(P4) through to M; and the ascending ramus (Fig. 2i and Supplemen- 
tary Information, section 3). 

Locality and horizon. Oligocene Nsungwe Formation, locality Nsungwe 
2B (as described above). 

Diagnosis. As for genus. For additional description and metrics, see 
Supplementary Information. 

Morphological features defining the earliest crown catarrhines 
have largely been a matter of speculation, with victoriapithecids and 
proconsuloids often defined primarily by the shared retention of prim- 
itive characters relative to later forms, rather than by demonstrable 
synapomorphies'*""”. Poor resolution of the phylogenetic branching 
pattern among fossil forms near the base of the cercopithecoid—hominoid 
split is further complicated by an unbalanced fossil record for the two 
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recently dated (via U-Pb CA-TIMS; bold type) carbonatite tuffs and several 
other dated tuffs (left), interpreted palaeomagnetic reversal stratigraphy of 
ref. 5 (virtual geomagnetic pole latitude (VGP lat.)) (centre), and ages (Myr 
ago; right) derived from the global polarity timescale (GPTS). Black bars, 
normal polarity; white bars, reversed polarity (see Supplementary Information 
for additional geological details). 


groups, with early cercopithecoids represented by relatively few early- 
mid Miocene taxa’* and Miocene apes exhibiting greater taxonomic 
diversity’* . The presence of taxa as distinctive as Prohylobates, Proconsul 
and Rangwapithecus in the African early Miocene indicates that the 
cercopithecoid—hominoid diversification initiated during the Oligocene’*”’, 
yet the only previously described late Oligocene catarrhines, Kamoyapithecus 
and Saadanius, are generally regarded as stem forms rather than mem- 
bers of the catarrhine crown clade**!”. As such, Rukwapithecus and 
Nsungwepithecus are the first described primates that document the 
presence of crown catarrhines as early as ~25 Myr ago. 
Nsungwepithecus represents the first cercopithecoid old enough to 
confirm the late Palaeogene crown catarrhine divergence estimates 
derived from molecular studies**. Nsungwepithecus shares with vic- 
toriapithecids numerous features of lower molar morphology includ- 
ing deeply incised buccal clefts, a high degree of buccal flare, and 
the lack of a buccal cingulid. Before the late Miocene, the published 
cercopithecoid record has largely been limited to rare and incomplete 
materials of Prohylobates and Zaltanpithecus collected from the early- 
middle Miocene of northern Africa'*"*'*, together with a spectacular 
array of over 2,500 specimens from a single taxon (Victoriapithecus) 
from mid-Miocene deposits in eastern Africa'*’®'°”", Additional cerco- 
pithecoid diversity has recently been recognized in early—middle 
Miocene faunas from Kenya and Uganda, represented by as many as 
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Figure 2 | Comparison of Nsungwe Formation primates with representative 
stem and crown catarrhines. a, Propliopithecus sp. (TQ 4, early Oligocene of 
Oman), right M;, reversed for comparison; b, Nsungwepithecus gunnelli gen. et 
sp. nov. (RRBP 11178, late Oligocene of Tanzania), left M,; c, Noropithecus 

bulukensis (KNM-WS 12642, early Miocene of Kenya), right M3, reversed for 
comparison; d, Victoriapithecus macinnesi (KNM-MB 18993, middle Miocene 
of Kenya), right M3, reversed for comparison; e, Propliopithecus haeckeli (SMN 
12638, early Oligocene (?) of Egypt), right P,-M; f, Aegyptopithecus zeuxis 


three species within the bilophodont genus Noropithecus"*. Collectively, 
these fossils have been grouped in the Victoriapithecidae (a basal cer- 
copithecoid group), with the acknowledgement that additional, more 
complete, fossils may reveal this to be a paraphyletic assemblage’. The 
recovery of a stem cercopithecoid older than 25 Myr ago significantly 
extends the record of this clade, documenting the presence of a rela- 
tively large and incompletely bilophodont monkey in the Oligocene of 
eastern Africa. 
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(DPC 3056, early Oligocene of Egypt), right P;-M3; g, Kalepithecus 
songhorensis holotype (KNM-SO 378, early Miocene of Kenya), right Pys—M3; 
h, Victoriapithecus macinnesi (KNM-MB 18993, middle Miocene of Kenya), 
right M,—-M; and reversed left P4; i, Rukwapithecus fleaglei gen. et sp. nov. 
(RRBP 12444A, late Oligocene of Tanzania), right Py-M; j, Rangwapithecus 
gordoni (KNM-SO 463, early Miocene of Kenya), right M,—M3. See 
Supplementary Information section 4 for imaging protocols, and 
Supplementary Videos 1 and 2 for additional views of Nsungwe specimens. 


Rukwapithecus shares two features with the Miocene apes and 
extant hominoids that are not present in cercopithecoids or Fayum 
stem catarrhines'*'®: buccal position of the M, hypoconulid, and 
mesial migration of cusps on the buccal side of lower molars such that 
the hypoconid is positioned opposite the lingual notch between the 
metaconid and the entoconid. In particular, Rukwapithecus shares 
numerous features with the early Miocene Rangwapithecus'*”*?”*, 
including cusp position and wear pattern, degree of crenulation and 
cingulid development, oblique orientation of the cristid obliqua on M; 
and M,, deep hypoflexid, deep distolingual fovea, and an enlarged M3. 


Figure 3 | Phylogenetic placement of Rukwapithecus, new genus. 
Temporally calibrated Adams consensus of four equally parsimonious trees 
recovered in PAUP 4.0b10 and the ‘allcompat’ tree calculated in MrBayes 3.2, 
based on analysis of the modified and taxonomically expanded 191-character 
morphological character matrix of ref. 26 (see Supplementary Information for 
details; thick dashed lines indicate branches that are not present in the strict 
consensus of all five trees; asterisks next to support values indicate that that 
node was constrained in either the parsimony or Bayesian analysis). Eocene 
Catopithecus was constrained as the most basal stem catarrhine due to its 
retention of several plesiomorphies that demonstrably evolved convergently in 
later catarrhines and the extant platyrrhine outgroups’’. Both analyses 
congruently placed Rukwapithecus as a nyanzapithecine stem hominoid, and, 
within “‘Nyanzapithecinae’, as the sister taxon of early Miocene 
Rangwapithecus. We obtained the same placement of Rukwapithecus when 
Lomorupithecus was removed and scorings for Afropithecus and 
Morotopithecus were combined into a single operational taxonomic unit, 
following an alternative taxonomic hypothesis of ref. 15. Numbers above and 
below branches are Bayesian posterior probability values and bootstrap values, 
respectively. We place quotation marks around “‘Nyanzapithecinae’ because 
reports of more complete materials of Mabokopithecus may eventually 
necessitate reassignment of some or all Nyanzapithecus species to the former 
genus'*” due to taxonomic priority’’, and may result in a name change for 
‘Nyanzapithecinae’. The new genus Nsungwepithecus was not included in the 
phylogenetic analyses, and is grafted onto the tree in its proposed placement as 
the most basal known stem cercopithecoid. Unambiguous synapomorphies for 
nodes numbered 1-5 are provided in the Supplementary Information. 
Divergence dates within crown Cercopithecoidea and crown Hominoidea are 
based on the molecular dating analysis of ref. 4 with independent rates and soft 
bounds. 
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Such features suggest that the lower molar pattern in the Rukwapithecus- 
Rangwapithecus clade was fairly conserved across the Oligocene-Miocene 
transition. Rukwapithecus nevertheless differs from Rangwapithecus in 
a number of features, exhibiting for example a narrower mesial fovea, a 
less tapered distal margin of M3, and relatively weak cresting between 
the lower molar entoconid and hypoconulid. Indeed, bootstrap support 
for the Rangwapithecus—Rukwapithecus clade in our parsimony analyses 
is very low (<50%), arguing against a particularly close (genus level) 
relationship (Fig. 3 and Supplementary Information, sections 5-7). 
Parsimony and Bayesian phylogenetic analyses**”” place Rukwapithecus 
as a stem hominoid nested within the ‘nyanzapithecine’ clade (sensu 
Harrison'’, see Fig. 3), but this result is not particularly robust (see 
Fig. 3 support values and Supplementary Information). In light of this, 
we cautiously place Rukwapithecus in both ‘Nyanzapithecinae’ and 
Hominoidea, but recognize that additional data from other parts of 
the dentition, cranium and postcranium are necessary for further test- 
ing and refining these hypotheses. 

The Cenozoic era of Africa records a remarkable and deep record 
of environmental change. During this time, tectonic activity in the 
prominent East African rift system’, uplift of the African plateau®’*, 
and climate aridification® had profound implications for Africa’s res- 
ident biota. Near the Oligocene-Miocene boundary, collision between 
the Afro-Arabian and Eurasian landmasses initiated periodic faunal 
interchange that contributed to the eventual replacement of many res- 
ident forms by immigrant species”. Given the paucity of palaeontolo- 
gical data from the 22-30-Myr interval in Afro-Arabia, fossils from the 
Rukwa Rift provide a rare window into Palaeogene catarrhine diversity 
during this period of dramatic change in African terrestrial ecosystems, 
with Nsungwepithecus and Rukwapithecus together comprising 40% of 
described late Oligocene anthropoid taxa. The precisely dated stra- 
tigraphy of the site suggests that early hominoid and cercopithecoid 
evolution in eastern Africa took place against the backdrop 
of previously unrecognized tectonic uplift in the western branch of 
the EARS*, coinciding with the global late Oligocene warming event”, 
and pre-dating larger-scale faunal shifts that intensified later in the 
Miocene. 


METHODS SUMMARY 


Rukwa specimens (RRBP 12444A and RRBP 11178) were scanned at the Ohio 
University MicroCT (OULCT) facility in Athens, Ohio, using a GE eXplore Locus 
in vivo small animal MicroCT scanner. The Rukwapithecus type specimen (RRBP 
12444A) was scanned at a slice thickness of 90 Lm, 80 kV, 495 mA yielding a voxel 
size of 0.09 X 0.09 X 0.09mm. For a more detailed reconstruction of occlusal 
surfaces RRBP 12444A was also scanned at a slice thickness of 20 um, 80 kV, 
495 mA. The latter protocol was also used for the Nsungwepithecus type specimen 
(RRBP 11178), yielding a voxel size of 0.02 X 0.02 X 0.02 mm for high-resolution 
scans. The resulting volume data (in VFF-format) were exported from MicroView 
2.2 (open-source software developed by GE; http://www.sourceforge.net) and 
imported into Avizo 6.3 (Visualization Sciences Group) for image segmentation, 
visualization and manipulation. Protocols for phylogenetic analysis and high- 
precision CA-TIMS U-Pb zircon ages for the fossiliferous locality are provided 
in Supplementary Information. 
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Long-term warming restructures Arctic tundra 
without changing net soil carbon storage 


Seeta A. Sistla', John C. Moore?’, Rodney T. Simpson’, Laura Gough’, Gaius R. Shaver? & Joshua P. Schimel! 


High latitudes contain nearly half of global soil carbon, prompting 
interest in understanding how the Arctic terrestrial carbon balance 
will respond to rising temperatures’*. Low temperatures suppress 
the activity of soil biota, retarding decomposition and nitrogen 
release, which limits plant and microbial growth’. Warming ini- 
tially accelerates decomposition**, increasing nitrogen availabil- 
ity, productivity and woody-plant dominance*’. However, these 
responses may be transitory, because coupled abiotic—biotic feed- 
back loops that alter soil-temperature dynamics and change the 
structure and activity of soil communities, can develop*’. Here 
we report the results of a two-decade summer warming experiment 
in an Alaskan tundra ecosystem. Warming increased plant biomass 
and woody dominance, indirectly increased winter soil temper- 
ature, homogenized the soil trophic structure across horizons 
and suppressed surface-soil-decomposer activity, but did not 
change total soil carbon or nitrogen stocks, thereby increasing 
net ecosystem carbon storage. Notably, the strongest effects were 
in the mineral horizon, where warming increased decomposer 
activity and carbon stock: a ‘biotic awakening’ at depth. 

The carbon balance of northern ecosystems will be regulated ulti- 
mately by the difference between carbon gains through greater prim- 
ary productivity and losses through increased soil organic matter 
(SOM) decomposition”’. The direct effects of warming on decompo- 
sers may be either mitigated or amplified by restructuring of the plant 
community, soil food web, and soil temperature regime. However, 
uncertainty regarding the strength of positive versus negative feed- 
backs that couple increased shrub dominance, decomposers and their 
shared soil environment complicates the ability to project future tun- 
dra carbon dynamics”’. 

Warming promotes woody-shrub growth in Arctic systems 
Shrubs trap snow, creating warmer winter soil temperatures that sti- 
mulate decomposers and support shrub dominance further’. Shrubs 
also increase summer shade, thereby reducing decomposer activity by 


11,12 


cooling soils'*. Whether the winter-warming or summer-cooling effects 
will dominate temperature-driven changes in decomposition, or 
whether they will effectively neutralize each other is unknown. Shrub 
expansion also alters both the nature of carbon inputs and where in the 
soil profile they occur. Relatively labile shrub leaf litter'*’> replaces 
slowly decomposing mosses at the soil surface, whereas greater root 
biomass may increase labile carbon inputs deeper in the soil’*’*. Such 
changes in the nature and location of carbon inputs will alter the activity 
and community structure of decomposers further’. Labile, nitrogen- 
rich litter favours a rapidly cycling community dominated by bacteria 
that promotes nitrogen availability’®, whereas lignified woody litter 
selects for a more slowly cycling fungus-dominated food web’. 

As mineral interactions can protect soil carbon physically, control 
over decomposition in mineral soils is different to that in overlying 
organic soils'’. Greater inputs of labile plant-derived carbon into the 
mineral horizon may stimulate decomposer communities'*”’, while 
increasing aggregate formation that can stabilize carbon and limit 
loss”. In parallel with these biotic feedbacks, increased thaw depth 
under warmer conditions will increase SOM availability in the mineral 
horizon, stimulating decomposers”. However, summer shade may ulti- 
mately reduce thaw depth, despite greater annual average soil tem- 
peratures’**', The largest fraction of the permafrost soil carbon is 
contained in the mineral soil’, and therefore these feedbacks can sub- 
stantially alter tundra carbon storage over time. 

We investigated the effects of two decades of summer warming 
on a moist acidic tussock (MAT) tundra ecosystem, to evaluate how 
warming-induced feedbacks affect Arctic carbon cycling. We sampled 
the longest-running whole-system tundra warming experiment: an in 
situ greenhouse experiment that was started in 1989 at the US Arctic 
Long Term Ecological Research site (68° 38’N, 149° 34’W). Twenty 
years of warming increased plant carbon storage and altered both plant 
community structure and the belowground food web; however, there 
was no net change in total soil carbon (P = 0.5) or nitrogen stocks 


Table 1 | Effects of greenhouse warming on tundra carbon and nitrogen pools after 20 years of treatment 


Soil characteristic Surface organic soil Deep organic soil Mineral soil 
Control Greenhouse Control Greenhouse Control Greenhouse 

Carbon (g m~*) 1,026.9 + 101.2 929.1+ 101.8 4,547.2 + 1064.4 3,906.6 + 638.0 6,381.7 + 1411.1 8,342.3 + 786.5 
Nitrogen (g m~?) 26.5+4.1 2/5 23.1 163.5 + 47.2 145.5 + 18.1 318.0 + 75.5 376.9 += 29.3 
Carbon:nitrogen ratio 42.2+1.7 35.8+ 2.6 31.5+14 284+13 20.4+0.4 22.4+ 0.6 
% Carbon 44.6+1.5 38.2 + 2.9 30.44 2.5 28.3 + 2.0 72215 8.7+1.8 
% Nitrogen 1.1+0.04 1.1 +£0.08 0.97 + 0.067 1.1+0.11 0.37 + 0.083 0.39 + 0.076 
Bulk density (gsoil 0.05 + 0.006 0.06 + 0.02 0.14+0.04 0.2 + 0.04 1.0+0.11 0.8+0.1 

3: 
percm*) 
Sampling depth (cm) 5=0 5 +0 10.1 21.5 11.2 1.2 17.0+1.6 15.7 +08 
Percentage soil 443.5+40.5 531.6 + 65.2 39184274 446.1 + 82.3 75.5+149 107.9+ 24.1 


moisture (g H20 perg 
dry soil) 


Numbers represent the mean + 1s.e. Soil pools include surface litter and fine roots. Surface organic soil, less than 5 cm in depth from the surface; deep organic soil, the soil between the surface organic and the 
mineral horizon. The mineral horizon was sampled to permafrost. Means that significantly differ from each other (within horizons, n = 4; «= 0.1) are indicated in bold. Mineral-horizon soil carbon and nitrogen 
stocks were also analysed using a modified equivalent soil-mass method. The maximum sampled control mineral-horizon soil mass among the four replicates was defined as the initial soil mass; all greenhouse 
and control mineral-horizon soil-sample replicates were set as equivalent to that mass and soil carbon stocks were recalculated. The equivalent soil-mass greenhouse mineral-horizon carbon stock was 
approximately 28% greater than control (control average = 7451.0 + 1043, greenhouse average = 9514.8 + 996, P= 0.06). 


Department of Ecology, Evolution and Marine Biology, University of California Santa Barbara, Santa Barbara, California 93108, USA. *Natural Resource Ecology Laboratory, Colorado State University, Fort 
Collins, Colorado 80523, USA. 3Department of Ecosystem Science and Sustainability, Colorado State University, Fort Collins, Colorado 80523, USA. “Department of Biology, University of Texas at Arlington, 
Arlington, Texas 76019, USA. 5The Ecosystems Center, Marine Biological Laboratory, Woods Hole, Massachusetts 02543, USA. 


30 MAY 2013 | VOL 497 | NATURE | 615 


©2013 Macmillan Publishers Limited. All rights reserved 


LETTER 


Table 2 | Effects of warming on plant community 


(a), Effects of greenhouse warming on per cent cover of tundra plants and litter 


Plant Year Treatment Control (1998, 2008) Greenhouse (1998, 2008) 
Betula nana = * 13.5 + 20,1602 23 16.6 +3:3,31.0+68 
Salix pulchra NS NS 3.7 £1.2,38215 5.72 14,53 + 1.6 
Eriophorum vaginatum NS if 10#19,19+25 11.2+28,15+42 

Carex spp. NS vee 3.0 + 0.74, 3.0 + 0.72 0.5 + 0.25, 1.0 + 0.38 
Vacinium vitis-idea NS 14.0 + 1.6,11.3 + 0.28 188+04,9.1 +23 
Rhododendron tomentosum NS NS 8.3 + 1.5,10.6 + 1.6 13.8 + 2.0, 9.8 + 0.86 
Cassiope tetragona * wt 70+0.97,5.7+18 3.4 + 1.2,0.69 + 0.23 
Empetrum nigrum “ NS 45+0.26,3.4+0.6 46+04,23 + 0.58 
Rubus chamaemorus NS * 8.9+2.9,6.8+ 1.6 13.1+1.9,11+1.9 
Pedicularis lapponica NS - 0.63 + 0.26,0.19 + 0.05 0.12 + 0.09, 0.013 + 0.013 
Lichen NS meee 46+1.2,3+0.27 0.075 + 0.052, 0.036 + 0.36 
Moss gi Fee 13.7 = 2.9,7.3 £098 822 1.1,2.7 £0.56 
Litter nee = 0.24+ 014,63 +08 2.6 + 1.0, 8.8 + 0.75 
Deciduous shrub - . 17.2 + 3.0, 20.4 + 3.0 22.4 + 3.3,36.7+7.0 
Evergreen shrub is NS 34.1 + 2.3,31.2 + 4.0 Al? + 3.7,22.1 23.7 
Graminoid - _ 13.4 + 1.1,219+ 23 11.7+29,164+45 

Forb NS NS 10.9 + 3.0,7.8 + 1.6 133 £19,11.2215 


(b), Effects of 14 years of greenhouse warming (2002 harvest) on tundra plant biomass 


Plant biomass (g dry weight per m?) Treatment Control Greenhouse 
Total aboveground (including mosses and lichens) ne 474.9 + 32.6 748.7 + 759 
Vascular aboveground eae 369.5 + 26.0 720.7 + 85.9 
Vascular belowground * 438.3 + 88.7 712.4 + 70.6 
Total aboveground and vascular belowground 7 913,14 1189 1,524.0 + 112.6 
Deciduous shrub * 218.7 + 51.8 551.2+ 119.1 
Evergreen shrub * 324.5 + 20.5 532.2 + 80.8 
Graminoid NS 227.0 + 57.5 179.0 + 92.9 
Forb eae 37.6. =.123 170.7 + 44.4 
Litter and standing dead * 569.3 + 134 758.4 +1714 
Moss ae 75.5 + 10.8 16.3+44 
Lichen * 29.9+64 11.8 + 6.7 


Numbers represent the mean + 1 s.e. of plant percentage cover (sampled in 1998 and 2008), and plant biomass (sampled in 2002). *P< 0.1, **P<0.01, ***P< 0.001 and ****P < 0.0001 for means that differ 


from each other significantly at «= 0.1 (n = 4). 


(P = 0.7; Table 1 and Supplementary Fig. 1). Although warming sig- 
nificantly increased surface litter inputs (Table 2a) and probably also 
increased their quality’®, the greatest biogeochemical changes were at 
depth, in the mineral soil. The greenhouse treatment increased mineral 
soil carbon by 31% (P = 0.06) and its carbon:nitrogen ratio by 9.8% 
(P= 0.05), whereas the surface organic soil carbon:nitrogen ratio 
decreased by 15% (P = 0.02; Table 1). 

The stability of the tundra soil’s overall carbon and nitrogen stocks 
under sustained warming contrasted with the vegetation (Table 2) and 
soil trophic structure (Fig. 1, Table 3 and Supplementary Tables 1-3). 
The greenhouse treatment affected the plant community structure 
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Discriminant function 1 


Figure 1 | A canonical discriminant analysis plot reveals the loss of trophic 
heterogeneity across soil horizons in the greenhouse treatment relative to 
control conditions. CDA biota groupings included total protozoa (amoebae, 
flagellates and ciliates were grouped) and did not include bacteria and fungi 
(which constitute a disproportionate majority of food-web biomass) (j1g carbon 
per g soil; Wilks’ lambda < 0.0001; n = 4; Table 3). 
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across all major MAT growth forms. The percentage cover of the 
deciduous shrub Betula nana increased by 94%, litter cover increased 
by 40%, and lichens and mosses declined by 99% and 63%, respectively. 
After 14 years of warming, greenhouse plant carbon stocks had 
increased by 50% (control = 446.5 +56.8gCm *, greenhouse = 
668.7 + 39.9 gCm >; P=0.02), whereas greenhouse plant nitro- 
gen did not significantly respond (control = 10.1+13gNm °, 
greenhouse = 13.5 +0.84gNm 7; P= 0.12). The greenhouse effect 
on plant biomass was consistent with the percentage cover after 
10and 20 years of warming. The soil food web was also altered by 
the greenhouse treatment. The trophic structure of control MAT soil 
is dominated by fungi and is stratified by depth in biomass, diversity, 
connectance and linkage density, with each horizon possessing a dis- 
tinct structure. The greenhouse treatment reduced the heterogeneity 
of the soil food web among horizons. In the control soil, food webs 
within each horizon differed from each other. Within the greenhouse 
treatment, only the soil communities within the surface organic (0 to 
5.cm in depth) and mineral horizons differed (Fig. 1 and Table 3). 
The surface organic soil carbon and nitrogen stocks were unaffected 
by warming although the carbon:nitrogen ratio declined (Table 1), 


Table 3 | Mahalanobis distances by horizon and treatment 


Horizon or Mahalanobis distances Mahalanobis distances by 
treatment by horizon (within treatment) treatment (within horizon) 
Surface organic Surface organic Deep organic Control versus 
versus deep versus mineral versus mineral greenhouse 
organic 

Control 243.4** 131** 234.1** - 

Greenhouse 74.27 115.4** 44.1 - 

Surface - - - 275.6*** 

organic 

Deep organic - - - 322.2*** 

Mineral - - - 0.1 


CDA units are squared. **P< 0.05, ***P<0.01. 
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Table 4 | Effects of greenhouse warming on tundra microbial biomass, activity and extractable nutrient pools during the growing season 


Soil characteristic Surface organic soil Deep organic soil Mineral soil 

Control Greenhouse Control Greenhouse Control Greenhouse 
Microbial biomass 6,050 + 1237 6,315 + 1366 3,137 + 496 3,810 + 667 151+44 176+31 
carbon (ug C per g soil) 
Microbial biomass 605 + 106 563 +127 285 + 63 307 +90 5.78+1.55 13.0 + 3.86 
nitrogen (ug N per g soil) 
Fungal: bacterial 181 +53 220+ 53 858 + 305 163 + 69 115+37 76+70 
biomass ratio 
SIR active microbial 6,011 + 733 4,584 + 664 1,604 + 434 1,848 + 351 59.9 + 7.26 83.0 + 4.28 
biomass (ug C-COz per g 
soil per day) 
Carbon mineralization 31.83 + 1.6 2715+ 11.6 12.37 + 2.2 15.46+3.4 0.43 + 0.05 0.64 + 0.04 
(14-day incubation; pg 
C-COz per g soil per day) 
Extractable organic 3.93 + 0.6 543413 25.16+4.6 27.95+88 84.31 + 10.2 71.71 +98 
nitrogen (mg m~2)* 
Extractable organic 111.07 +183 138.11 437.1 610.94 + 192.3 917.60 +572 1,343.14 + 160.2 1,092.11 + 97.1 


carbon (mgm °)* 


* Scaling to m~* uses average plot bulk density and volume. Numbers representthe mean + 1s.e. of soil microbial biomass, activity and extractable nutrient pools. Means that differ from each other significantly at 


a= 0.1 (within horizons; n = 4) are indicated in bold. C-COz, carbon derived from or originating from COz. 


reflecting a decline in carbon-rich mosses and lichens (Table 2) and 
increased inputs of leaf litter from B. nana and Rubus chamaemorus 
with lower carbon:nitrogen ratios”. We observed significant reduc- 
tions in the greenhouse surface organic soil’s root-feeding (P = 0.07) 
and predator nematode biomass (P = 0.07), as well as active microbial 
biomass (measured by substrate induced respiration (SIR); Table 4). 
This re-alignment of the trophic pathways and overall suppression of 
the surface organic decomposer food web activity highlights that 
changes in plant litter inputs alone do not account for soil-community 
responses to warming. It also challenges our hypothesis that soil nitro- 
gen enrichment coupled with a decline in relatively recalcitrant matter 
correlates with more rapid carbon-cycling in the organic soil horizons. 
In the deep organic horizon (5 cm below the surface to mineral soil 
horizon), both bacterivores (P = 0.09) and the ratio of fungal to bac- 
terial biomass (P = 0.08) declined, indicating that warming restricted 
both the root- and detritus-fed trophic pathways. 

The altered soil-community structure and activity in the organic 
horizons may be driven in part by shifted greenhouse soil-temperature 
patterns, which are consistent with the snow-trap warming model of 
tundra shrub expansion (Fig. 2 and Supplementary Table 4). The 
greenhouse treatment initially warmed the soil so that its temperature 
was up to 2 °C higher than ambient temperature during the summer”. 
Although the summertime greenhouse soil warming declined over the 
course of the experiment”, by 1996 a winter warming effect had 
developed in the mineral soil that consistently exceeded the direct 
effects on summer soil temperature (Supplementary Table 4). We 
propose that at the onset of the experiment, soil respiration increased, 
similar to an increase in soil respiration that was observed 3.5 years into 
another warming experiment nearby” and in meta-analyses of short- 
term tundra responses to warming’. Declining summertime organic 
soil respiration (the dominant period of tundra soil activity™*), coupled 
with greater litter inputs, may have offset initial warming-driven car- 
bon losses, causing no net change in organic horizon carbon stocks. 

Unlike the overlying organic horizons, the greenhouse treatment 
stimulated mineral soil-decomposer activity; microbial biomass nitro- 
gen more than doubled, SIR biomass increased by 39%, and incubation- 
based carbon mineralization was approximately 1.5 times greater than 
ambient levels, whereas extractable organic carbon was reduced by 19% 
(Table 4). This may reflect increased labile carbon consumption by the 
more active decomposer community. The greenhouse food web became 
more homogenized across depth relative to control conditions, owing to 
a vertical redistribution of functional groups, including the presence of 
phytophagous nematodes and oribatid mites (not found in the mineral 
soil under control conditions), and the loss of non-oribatid mites, in the 
greenhouse mineral soil (Fig. 1, Table 3, and Supplementary Tables 1 
and 2). Thus, although the detritus-fed pathway underwent a transition, 
the root-derived trophic pathway expanded. 


The stimulation of the greenhouse mineral soil food web is probably 
caused by increases in late August-thaw depth (Supplementary Fig. 2), 
in annual soil temperature and in the number of plant-derived 
resources at depth. Root-derived carbon is a primary resource for soil 
food webs” but roots (which in MAT extend into the mineral soil!*) 
also contribute substantially to stable SOM pools”. Increases in the 
volume of litter that moves deeper owing to cryoturbation’, and greater 
fine-root biomass at the interface of the organic and mineral horizons 
may have promoted the changes observed in the greenhouse mineral soil. 
Although we did not measure fine-root biomass in 2008, belowground 
vascular biomass had increased by 2002 (Table 2b), and B. nana root 
production is stimulated by warm conditions’. Furthermore, B. nana 
dominance increases overall rooting in the deep organic horizon’*. We 
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Figure 2 | A discontinuous record of the difference between greenhouse 
treatment and control soil. Temperature probes up to approximately 10, 

20 and 40 cm in depth, from 1991 to 2008. Monthly averages of daily 
temperatures were used in each block. Vertical lines indicate when the 
greenhouse is actively warming. Only in 1991 and 1996 are summer greenhouse 
soil temperatures are significantly higher than control temperatures (P < 0.1; 
n = 1). Winter (January to March) temperatures do not initially differ between 
greenhouse and control; however, from 1996 to 2008, winter greenhouse soil 
temperatures exceeded ambient conditions in all horizons and exceeded the 
observed summer temperature differences (P < 0.0001; Supplementary Table 4). 
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propose that warming stimulated greater rhizome and root biomass in 
the greenhouses’ deep organic and mineral soils, which increased the 
carbon-rich root exudates and leachates moving into the mineral horizon 
in these highly water-saturated soils. 

Long-term tundra warming increased plant biomass carbon and the 
dominance of woody plants, altered the structure and activity of the 
soil food web, and changed both the vertical distribution and turnover 
of soil carbon without affecting overall soil carbon storage. We infer 
from this that 20 years of warming has promoted net ecosystem carbon 
storage. Our results also show that deeper mineral soils are susceptible 
to coupled biotic-abiotic effects driven by warming over decades. 
Although increased decomposer activity did not offset increased car- 
bon inputs in the mineral soil, incubation studies suggest that labile 
carbon limits tundra mineral-soil-decomposer activity”. Thus, 
although greater carbon availability at depth may initially increase 
carbon storage, it remains uncertain whether the ecosystem response 
observed after 20 years of warming reflects a continued trajectory of 
increased net carbon storage or a transient state in which an activated 
decomposer system will ultimately outpace carbon inputs. As such, 
identifying the mechanisms under which warming stimulates and 
regulates tundra decomposer activity at depth—where the majority 
of permafrost soil carbon is stored—remains a pressing challenge. 


METHODS SUMMARY 

Site description. The study was established in 1989 in MAT near Toolik Lake, 
Alaska, the United States (68°38'N, 149°34’W), and is described in detail 
elsewhere'>*>, MAT vegetation is similar to approximately 0.9 X 10°km* of 
tussock tundra found globally”. 

Experimental design. The study consists of four blocks containing control and 
greenhouse plots. The greenhouses are made of wooden frames (2.5 X 5 X 1.5m) 
and 0.15-mm polyethylene sheets, and are erected yearly, immediately after snow- 
melt. In July 2008, we sampled four randomly selected subplots per plot (” = 4)’. 
Green litter was removed, and organic monoliths (8 X 8 cm) were collected and 
separated into surface organic (0-5 cm), deep organic (5 cm to mineral horizon) 
and mineral soil (sampled to the permafrost). Plant, soil and fauna traits were 
measured using standard methods****. 

Statistical analyses. Data were tested for normality. A blocked analysis of variance 
(ANOVA), repeated measures ANOVA followed by a Holm-Bonferroni correc- 
tion for multiple comparisons across years, or a Wilcoxon signed-rank test was 
used to compare means with a two-tailed t-test at « <0.1; the fixed effects were 
treatment, year and their interaction (as appropriate), separated by horizon. Plant 
cover data were arcsine-square-root transformed and plant biomass data were 
natural-log-transformed. Food web data were log-transformed and analysed using 
ANOVA (comparing horizons across treatment and testing for a depth effect 
within treatment, and then using Tukey’s honestly significant difference (HSD) 
for multiple comparisons) and canonical discriminant analysis (CDA). Food-web 
biomass estimates were the CDA interval variables; treatment and soil horizon 
combinations constituted the classification variables. The model was run with 
bacteria and fungi, with similar effects. A soil food web connectedness description 
based on functional groups was used to compare horizons within and across 
treatments. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Experimental treatment. The greenhouse treatment reduces photosynthetically 
active radiation and direct input from precipitation, but does not negatively influ- 
ence plant growth or affect soil moisture (as far as it was possible to detect this), 
and uneven microtopography enables air circulation beneath the greenhouse 
bases'*”’. The greenhouse air temperature is elevated by 2 °C on average, and soil 
temperature increased by more than 1 °C on average (up to 40 cm in depth)”. Soil 
temperature is measured at a depth of approximately 10, 20 and 40cm using a 
Campbell CR21x data logger that recorded two profiles of soil temperature per 
treatment in one block with copper or constantin thermocouple wires. Soil-tem- 
perature data are patchy because of field-equipment failures and irreconcilable 
movement of the probes with freeze-thaw cycles. The probability of making a type 
Terror (& = 0.1) was chosen to balance the potential for making type II versus type 
I errors, given the study’s small replicate number (n = 4). 

Soil harvest and analyses of soil carbon and nitrogen pools. Live coarse roots 
were removed from soil subsamples by hand, and the volume and mass of rocks 
were determined and subtracted from the volume and mass of soil in the labor- 
atory. The remaining soil was subsampled for gravimetric water content, and 
carbon and nitrogen concentrations. Soil subsamples were dried to a constant 
mass and carbon and nitrogen concentrations were determined using a Fisons 
NA1500 carbon and nitrogen analyser. The mass of carbon and nitrogen in each 
horizon (g carbon per m? and g nitrogen per m*) was calculated by multiplying the 
percentage of carbon or nitrogen per subsample with the corresponding quadrat 
subsample bulk density and volume, and dividing this value by the subsample area. 
Treatment effects on the percentage of carbon or nitrogen in soil, and on the mass 
of carbon and nitrogen in horizons were analysed at the block level using the 
average of the four block subsamples. Subsamples from the four individual quad- 
rats, separated by horizon, were composited (aggregated) at the block level by hand 
for further biogeochemical analyses at the block level’*. 

Vegetative cover and biomass. Vegetative cover was quantified visually in eight 
permanent1-m” quadrats per plot. A plant harvest in July 2002 used four subplots 
per block, with vegetation (excluding roots) cut through the mineral-soil surface 
(including rhizomes that extended into the mineral horizon)’. 

Carbon mineralization and SIR microbial biomass. Composited soils (approxi- 
mately 8g wet weight for the surface organic soil and approximately 10g wet 
weight for the deep organic and mineral horizons) were weighed out into airtight 
incubation vessels and incubated at approximately 5°C (similar to in situ soil 
conditions) for 14 days to capture carbon-mineralization rates immediately after 
soil harvest. Cumulative carbon mineralization (CO, accumulation) was mea- 
sured by drawing headspace air samples from the incubation vessels using airtight 
glass syringes and injecting each sample into an infrared gas analyser (IRGA; LI- 
COR 6252). The carbon-mineralization rate was calculated as the change in head- 
space CO concentrations (in g C-CO, (carbon originating or derived from 
CO,)) per g soil (dry weight equivalent) per day over the 14-day incubation period. 
Microbial biomass at the end of the carbon-mineralization incubation period was 
estimated by SIR***?. Samples were amended with autolysed yeast extract (a 10-ml 
solution of 3 g yeast extract dissolved into 250 ml milli-Q HO) in airtight tubes 
and placed horizontally on a shaker (100 r.p.m.) for 6h. Headspace CO, concen- 
trations were monitored after 3 and 6h. The average respiration rate (tug C-CO2 
perg soil perh) over the incubation period is an index of the SIR-responsive 
microbial biomass. 

Extractable microbial-biomass carbon and nitrogen. Chloroform-extractable 
microbial biomass was determined as described elsewhere” on approximately 8 g 
wet weight of surface organic and approximately 10 g wet weight of deep organic 
and mineral composited soil, immediately after soil collection. Microbial-biomass 
carbon or nitrogen (extractable plus non-extractable biomass) was calculated by 
dividing the measured extractable biomass values by correction factors of 0.45 for 
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carbon” and 0.54 for nitrogen™. Extractable inorganic nitrogen was analysed 
using the 0.5M K;SO, extracts on a Lachat autoanalyzer (NH, using Lachat 
method no. 31-107-06-5-A; NO; using Lachat method no. 12-107-04-1-B). 

Direct counts of bacteria and fungi. Bacterial and fungal biomass were estimated 
using a direct count method*®”’. All finished samples were stored at 4°C until 
direct counts could be made. Bacterial direct counts were made using a confocal 
microscope at X 1,500 magnification. Biomass conversions were made assuming 
an average dry weight of 6.65 X 10° g per bacterial cell*°. Fungal hyphal length 
was estimated using a confocal microscope at X 400 magnification by counting the 
number of times hyphae crossed an ocular lens grid. Hyphal length was estimated 


NA 
using the equation R= OH where R is the total hyphal length, N is the number of 


times hyphae crossed the horizontal lines on the grid, A is the area of one slide well, 
and H is the total length of the horizontal grid lines. Biomass conversions were 
made assuming 2.3 X 10° gm! of hyphae. 

Protozoan biomass. Protozoan densities were estimated using a most-probable- 
number (MPN) technique”, with a starting serial dilution of 10 g of sieved soil in 
90 ml of sterile de-ionized water, with tenfold dilutions to 10~° ml. Four 0.5-ml 
subsamples were taken from each dilution in the series and individually trans- 
ferred to a well in a standard 24-well plate. A 50-11 suspension of the bacterium 
Escherichia coli was transferred to each well. Plates were incubated at 26 °C for 
3 days, after which individual wells were observed using an inverted compound 
microscope at X 100 magnification. Wells were scored for the presence or absence 
of ‘ciliates’, ‘flagellates’ and ‘amoebae’. The densities of total protozoa, and the 
individual subgroupings were estimated from the frequency distribution of pos- 
itive reads across the dilution series using the MPN estimation program provided 
by the US Environmental Protection Agency. 

Invertebrate biomass. Soil fauna were isolated from fresh soils and enumerated as 
described previously**. Soil nematodes were isolated using Baermann funnels. 
Microfauna were heat extracted into a solution of 90% ethanol and 1% glycerine 
from soils using Tullgren funnels. The intact samples were wrapped individually in 
cheesecloth, weighed, placed over a funnel and heated with a 9-W incandescent 
light for 5 days. The dried soils were re-weighed, and the arthropods were sorted 
into taxonomic groupings and functional groups. Enchytraeids, ciliates, rotifers, 
tardigrades, nematodes and microarthropods (collembola, mites, copepods and 
insects) were counted from 5-g subsamples of sieved soil immersed in de-ionized 
water using a dissecting microscope. 
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Synthetic analog computation in living cells 


Ramiz Daniel':**, Jacob R. Rubens”*"*, Rahul Sarpeshkar!?:*>°7* & Timothy K. Lu234:6-8# 


A central goal of synthetic biology is to achieve multi-signal integ- 
ration and processing in living cells for diagnostic, therapeutic and 
biotechnology applications’. Digital logic has been used to build 
small-scale circuits, but other frameworks may be needed for effi- 
cient computation in the resource-limited environments of cells~’. 
Here we demonstrate that synthetic analog gene circuits can be engi- 
neered to execute sophisticated computational functions in living 
cells using just three transcription factors. Such synthetic analog 
gene circuits exploit feedback to implement logarithmically linear 
sensing, addition, ratiometric and power-law computations. The 
circuits exhibit Weber’s law behaviour as in natural biological sys- 
tems‘, operate over a wide dynamic range of up to four orders of 
magnitude and can be designed to have tunable transfer functions. 
Our circuits can be composed to implement higher-order functions 
that are well described by both intricate biochemical models and 
simple mathematical functions. By exploiting analog building-block 
functions that are already naturally present in cells*”, this approach 
efficiently implements arithmetic operations and complex functions 
in the logarithmic domain. Such circuits may lead to new applica- 
tions for synthetic biology and biotechnology that require complex 
computations with limited parts, need wide-dynamic-range biosen- 
sing or would benefit from the fine control of gene expression. 

In natural biological systems, digital behaviour is appropriate in 
settings where decision making is necessary, such as in developmental 
circuits’. The digital approach is an abstraction of graded analog func- 
tions, where values above a threshold are classified as ‘1’ and values 
below this threshold are classified as ‘0’ (Fig. 1a). Digital computa- 
tion using synthetic gene circuits has included switches, counters, 
logic gates, classifiers and edge detectors (see references 28-40 in 
Supplementary Information). However, given that there is often 
unwanted crosstalk amongst synthetic devices’ and cellular resource 
limitations*’, it may be challenging to scale digital logic functions to 
the level needed for complex computations in living cells. Analog 
functions can be found in natural biological systems, where they enable 
graded responses to environmental signals’*. For example, neurons 
can implement both digital and analog computation’. Furthermore, 
electronic circuits that perform analog computation on logarithmically 
transformed signals have been used in commercially valuable electronic 
chips for several decades. The thermodynamic Boltzmann exponential 
equations that describe electron flow in electronic transistors are strik- 
ingly mathematically similar to the thermodynamic Boltzmann expo- 
nential equations that describe molecular flux in chemical reactions’. 
Their similarities suggest that log-domain analog computation in elec- 
tronics may be mapped to log-domain analog computation in chem- 
istry, and vice versa’. Because analog computation exploits powerful 
biochemical mathematical basis functions*"® that are naturally present 
over the entire continuous range of input operation (Fig. la), they are an 
advantageous alternative to digital logic when resources of device count, 
space, time or energy are constrained”, 


We first created an analog synthetic gene circuit motif that performs 
positive, wide-dynamic-range logarithmic transduction of inducer con- 
centration inputs to fluorescent protein outputs (Fig. 1b). The resulting 
transfer functions have a broad region of linearity on a semi-log plot 
(log-linear). Logarithmic functions can enable intensity-independent 
responses and can compress a large input dynamic range into a smaller, 
manageable output dynamic range. A logarithmic function naturally 
implements Weber’s law behaviour, which states that the ratio between 
the perceptual change in a signal divided by its background level is a 
constant, resulting in the detection of fold changes rather than absolute 
levels*. Weber’s law is approximately true within molecular signalling 
networks and in describing human perception of sound intensity, light 
intensity and weight*. 

Our wide-dynamic-range, positive-logarithm circuit consists of a 
positive-feedback component located on a low-copy-number plasmid 
(LCP) and a ‘shunt’ component located on a high-copy-number plas- 
mid (HCP). The LCP comprises promoters which generate transcrip- 
tion factors that bind to the promoters, thus increasing the generation 
of transcription factors by means of a positive-feedback loop and 
alleviating saturation of the interaction between inducers and trans- 
cription factors. The HCP comprises promoters that bind and shunt 
away the generated transcription factors, effectively redirecting some 
of them away from the LCP and reducing saturation of the interac- 
tion between transcription factors and DNA. The HCP shunt also 
affects the effective strength of the LCP positive-feedback loop. The 
HCP generates a reporter or protein output that is different from the 
transcription factor output (Fig. 1b). We initially applied this 
‘positive-feedback/shunt’ (PFS) circuit motif to the AraC transcrip- 
tional activator, which is induced by arabinose and activates the Pgap 
promoter (Fig. 1b and Supplementary Fig. 6). 

The AraC-based PFS circuit had an input-output transfer function 
that was well fitted by a simple mathematical function of the form 
In(1 + x), which is a first-order approximation of the Hill function for 
small values of x, where x is a scaled version of the input concentration 
(Fig. 1d and Supplementary Information, section 4). Furthermore, this 
circuit had a wide input dynamic range of more than three orders of 
magnitude, where the dynamic range is taken to be the span of inputs over 
which the output is well fitted by In(x) (Fig. 1d and Supplementary Fig. 
18). The simple logarithmic mathematical functions that describe our 
wide-dynamic-range circuits are useful for designing higher-order func- 
tions. The wide-dynamic-range behaviour of our circuit was especially 
striking when compared with the narrow dynamic range of the open-loop 
control, which has a shunt motif but no positive-feedback motif. This 
‘open-loop/shunt’ motif is shown in Fig. 1c, d and in Supplementary 
Fig. 16. When the shunt plasmid in the PFS motif contains a P),,,, promoter 
rather than a Pgap promoter, wide-dynamic-range logarithmic operation 
for the AraC-based circuit is also absent (Supplementary Fig. 17). These 
controls indicate the importance of graded positive feedback, implemen- 
ted here with the PFS motif, in achieving wide-dynamic-range operation. 


lAnalog Circuits and Biological Systems Group, Research Lab of Electronics, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA. *Synthetic Biology Group, Research Lab of 
Electronics, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA. ?Synthetic Biology Center, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA. 
4MIT Microbiology Program, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA. °Department of Electrical Engineering & Computer Science, Massachusetts Institute of 
Technology, Cambridge, Massachusetts 02139, USA. °MIT Computational and Systems Biology Program, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA. ’MIT Biophysics 
Program, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA. 8Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 
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Figure 1 | Positive-feedback linearization of gene circuits for wide- 
dynamic-range analog computation. a, Synthetic analog gene circuits use the 
inherent continuous behaviour of biochemical reactions to perform 
computations and implement mathematical functions over a wide dynamic 
range, whereas digital circuits abstract this behaviour into discrete ‘0’s and ‘I’s. 
b, The AraC-based, wide-dynamic-range, positive-logarithm circuit 
logarithmically transforms input inducer concentrations into output protein 
levels. This topology involves a transcriptional positive-feedback (PF) loop on 
an LCP, which alleviates saturated binding of inducer to transcription factor, 
along with a shunt HCP containing transcription-factor-binding sites, which 
alleviates saturation of DNA-binding sites. Arab, arabinose. c, The open-loop 
(OL) control consists of AraC-GFP expression from a Pja¢o promoter with no 
positive feedback. d, Arabinose-to-mCherry transfer function. The positive- 


To gain deeper insights into the mechanisms that may give rise to 
logarithmically linear transfer functions, we built detailed bioche- 
mical models that capture the effects of inducer/transcription-factor 
binding, transcription-factor/DNA binding, the PFS circuit topology 
and protein degradation (Fig. le and Supplementary Information, 
section 2). Using a consistent set of model parameters that differ only 
on the basis of the various circuit topologies (for example in plasmid 
copy number), our biochemical models accurately capture the beha- 
viour of our multiple circuits (Figs 1 and 2 and Supplementary 
Fig. 31). A minimal biochemical model, which incorporates only the basic 
effects of graded positive feedback, also shows linearization (Supplemen- 
tary Fig. 30). Indeed, our scheme for widening the log-linear dynamic 
range of operation by means of graded positive feedback is conceptually 
general and applies to both genetic and electronic circuits: expansive 
hyperbolic-sine-based linearization of compressive hyperbolic-tangent- 
based functions in log-domain electronic circuits'' is analogous to the 
use of expansive positive-feedback linearization of compressive bioche- 
mical binding functions in log-domain genetic circuits. 

We generalized these findings to the quorum-sensing transcriptional 
activator LuxR, which is induced by acyl homoserine lactone (AHL) and 
activates the promoter P),,. (Fig. 2a). As fluorescent outputs of this cir- 
cuit, we fused green fluorescent protein (GFP) to the carboxy terminus of 
LuxR and used an HCP P;,,.-mCherry shunt. The LuxR PFS circuit also 
had an input dynamic range of more than three orders of magnitude 
(Fig. 2b) and performed robustly over multiple time points (Supplemen- 
tary Fig. 26). This input dynamic range was significantly greater than 
that achieved with LuxR-GFP positive feedback alone or with LuxR- 
GFP positive feedback with a medium-copy-number plasmid (MCP) 
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feedback LCP with a shunt HCP (black triangles) implements a wide-dynamic- 
range, positive-slope logarithm circuit with an input dynamic range of more 
than three orders of magnitude. It is well fitted by a mathematical function of 
the form In(1 + x), where x is a scaled version of the input inducer 
concentration. In contrast, the open-loop LCP with a shunt HCP (orange 
squares) has a narrow dynamic range and is well fitted by a Hill function. a.u., 
arbitrary units. e, Comparing the positive-feedback LCP with a shunt MCP (red 
diamonds) and the positive-feedback LCP with a shunt HCP (black triangles; 
data from d) demonstrates the importance of the shunt plasmid in making 
possible the wide-dynamic-range operation. Solid lines indicate modelling 
results of the detailed biochemical model (Supplementary Information). The 
errors (s.e.m.) shown in the figures are derived from three flow cytometry 
experiments, each of which involved n = 50,000 events. 


shunt (Fig. 2b). The output of the shunt plasmid (mCherry) had similar 
properties and thus can also be used for computation (Fig. 2c). As in 
the AraC-based circuits (Fig. 1), both detailed biochemical models 
(Fig. 2b, c), where the only varying parameter was the plasmid copy 
number, and the simple In(1 + x) mathematical function (Fig. 2b, ¢, 
insets) captured the behaviour of the LuxR-based circuits. 

The behaviour of the PFS circuit motif can be dynamically tuned by 
changing the relative copy numbers of the positive-feedback and shunt 
plasmids. We demonstrated such tuning by combining an HCP shunt 
with a variable-copy-number plasmid (VCP), based on a pBAC/oriV 
vector’’, carrying the positive-feedback component (Fig. 2d). When 
the VCP was induced to a high-copy-number state, the circuit had a 
narrow dynamic range of about two orders of magnitude and was 
poorly fitted by a In(1 + x) function but could be fitted by a “digital- 
like’ Hill function (Fig. 2e). When the VCP was in a low-copy-number 
state, the circuit behaved in an analog fashion, followed a In(1 + x) 
mathematical relationship and had a broad dynamic range of nearly 
four orders of magnitude. Such tuning demonstrates the importance of 
the relative copy numbers of the positive-feedback and shunt plasmids 
in enabling wide-dynamic-range logarithmic operation. It also pro- 
vides a mechanism for actively changing circuit behaviour between 
analog and digital modes and shows that the PFS circuit motif can be 
reliably used in different Escherichia coli strain backgrounds. 

Our analog computation modules can be composed into more 
complex circuits for higher-order functions, as shown in Supplemen- 
tary Information, section 10, and Supplementary Fig. 31. There we 
show how to compose a wide-dynamic-range PFS logarithm motif 
with a Lacl repressor module to create a wide-dynamic-range negative 
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Figure 2 | Analog behaviour versus digital. a, The LuxR-based, wide- 
dynamic-range, positive-logarithm circuit. b, The AHL-to-GFP transfer 
function for a positive-feedback LCP (blue circles), a positive-feedback LCP 
with a shunt MCP (red diamonds) and a positive-feedback LCP with a shunt 
HCP (black triangles). The positive-feedback LCP with a shunt HCP 
implements a wide-dynamic-range, positive-slope logarithm circuit. Solid lines 
indicate modelling results of the detailed biochemical model; the inset shows 
the fit of a mathematical function of the form In(1 + x). c, The AHL-to- 
mCherry transfer function for a positive-feedback LCP with a shunt MCP (red 
diamonds) and a positive-feedback LCP with a shunt HCP (black triangles). 
The positive-feedback LCP with a shunt HCP implements a wide-dynamic- 
range, positive-slope logarithm circuit. Solid lines indicate modelling results of 
the detailed biochemical model; the inset shows the fit of a mathematical 
function of the form In(1 + x). d, Placing the positive-feedback loop on a VCP 


logarithm motif of the form —In(1 + x). The latter motif functions 
over four orders of magnitude and is well predicted by detailed bio- 
chemical models and by compositions of empirical fits. 

As an application of synthetic analog genetic circuits, we created an 
adder circuit that sums its protein inputs, which are logarithmically 
transformed representations of inducer molecules (Fig. 3a). Such sum- 
mation is achieved by combining two parallel, wide-dynamic-range, 
positive-logarithm circuits that accept different input molecules (for 
example AHL and arabinose) but produce a common output molecule 
(Fig. 3a). The adder exhibited log-linear behaviour over a range of two 
orders of magnitude (Fig. 3b and Supplementary Fig. 21). Because log- 
linear addition of two inputs effectively results in the logarithm of their 
product, an analog product is equivalent to a ‘soft AND’ operation, 
that is, to a graded multiplication rather than to a one-bit digital 
multiplication. Thus, the data of Fig. 3b have similarities to that 
obtained from digital AND circuits, except that the overall function 
is more graded in nature. 

We also developed a PFS circuit that calculates the log-transformed 
ratio of two different input inducers (Fig. 3c). Ratiometric calculations 
are useful in biological systems, because they enable the normalization 
of measurements, comparisons between variables and decisions based 
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allows the dynamic adjustment of AHL-to-mCherry transfer functions 
between analog and digital behaviours using a CopyControl (CC) induction 
solution. The VCP is normally maintained at low copy numbers and can be 
induced to higher copy numbers through CopyControl-mediated expression of 
replication protein TrfA from a promoter integrated into the genome of EPI300 
cells'*. e, At high copy numbers (CC ON, red diamonds), the circuit behaves in 
a digital-like fashion. The dotted red line is a Hill function fit to the digital-like 
curve. The dashed black line reveals that the digital-like curve is not well fitted 
by aln(1 + x) function. At low copy numbers (CC OFF, blue circles), the circuit 
behaves in an analog fashion with a wide dynamic range. The dashed blue line 
indicates that the PFS positive logarithm is well fitted by a In(1 + x) function. 
The errors (s.e.m.) in the figures are derived from three flow cytometry 
experiments, each involving n = 50,000 events. 


on competing inputs. Our ratiometer circuit was built by combining a 
wide-dynamic-range, negative-logarithm circuit and a wide-dynamic- 
range, positive-logarithm circuit that accept different input molecules 
but produce a common output molecule (Fig. 3d). This circuit essen- 
tially calculates the difference between the log-transformed outputs of 
the two inputs (subtraction in the log domain). By tuning the ribo- 
some-binding sequences of the negative- and positive-logarithm cir- 
cuits such that the magnitudes of their slopes are similar, the resulting 
mathematical function is the log-transformed ratio of the two inputs 
and functions over four orders of magnitude of this ratio. Our wide- 
dynamic-range ratiometer enables the concept of pH, which measures 
the logarithmic concentration ratio of H* with respect to an absolute 
value, to be generalized to the concept of ‘pRATIO’, which may be 
useful for measuring the logarithmic concentration ratio of one input 
with respect to another input. 

In addition to logarithmically linear addition and ratiometric calcu- 
lations, synthetic analog circuits in living cells can implement power 
laws. To implement such laws, we introduced an inducer/transcription- 
factor binding function into a strong negative-feedback loop that 
includes two stages of amplification (Fig. 3e). In the first stage, araC- 
GFP produced from PlacO on a LCP activates lacI-mCherry produced 
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Figure 3 | Log-domain analog computation. Complex analog computation 
can be implemented by composing synthetic gene circuits. a, An adder is built 
by engineering two wide-dynamic-range, positive-slope logarithm circuits 
(modules outlined in red) to produce a common output, which is summed to 
yield the overall output. b, The adder circuit sums the logarithms of two inputs, 
AHL and arabinose, over about two orders of magnitude, to produce an output, 
mCherry. ¢, A division circuit, or ratiometer, is implemented when the slopes of 
the wide-dynamic-range, positive- and negative-logarithms (modules outlined 
in red) are closely matched by tuning their output ribosome-binding sites 
(RBSs). IPTG, isopropyl-B-p-1-thiogalactopyranoside. d, The ratiometer 
performs a logarithmic transformation on the ratio of two inputs, arabinose 


from PBAD on a HCP. In the second stage, LacI-mCherry represses 
araC-GFP production at PlacO, completing the negative-feedback loop. 
The power-law nature of our circuit arises from the interactions of 
saturated-repressor polynomial functions and a linear activator poly- 
nomial function in a feedback loop (Supplementary Information, sec- 
tion 2). The power-law behaviour of our circuit extended over two 
orders of magnitude, was accurately predicted by our detailed bioche- 
mical models and was well matched by a simple power law, x” (Fig. 3f). 
We have shown that powerful wide-dynamic-range analog com- 
putations can be performed with just a few biological parts in living 
cells. Four-input-bit, two-output-bit square roots can be calculated 
in vitro using 130 DNA strands within a DNA-based computation 
framework’’. In comparison, our in vivo, analog power-law circuit 
exploits binding functions that are already present in the biochemistry 
and therefore requires only two transcription factors. Even one-bit full 
adders and subtractors in digital computation require several logic 
gates and, thus, numerous synthetic parts (refs 34, 35 and 37 Sup- 
plementary Information). Our wide-dynamic-range analog adders 
and ratiometers are inherently implemented by circuits that add flux 
to or subtract flux from a common output molecule, and can be con- 
structed with no more than three transcription factors (Fig. 3). 
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and AHL, over more than three orders of magnitude. IPTG was held constant at 
1.5 mM. The dotted blue line indicates a log-linear fit. e, A negative-feedback 
loop with tunable feedback strength implements power-law functions. This 
circuit motif uses LacI-mCherry produced on an HCP to suppress the 
production of AraC-GFP on an LCP. When induced by arabinose, AraC-GFP 
enhances the production of LacI-mCherry. f, Power-law behaviour can be 
observed in the IPTG-to-mCherry transfer function, with arabinose constant at 
4.6 mM. The solid line indicates modelling results of the detailed biochemical 
model; the inset shows the fit to a power law of the form x°” The errors (s.e.m.) 
shown in the figures are derived from three flow cytometry experiments, each of 
which involved n = 50,000 events. 


In Supplementary Information, section 11, we outline potential 
applications of our efficient circuit motifs in biotechnology and medi- 
cine for fine control of gene expression or wide-dynamic-range sensing. 
Our wide-dynamic-range analog computation circuits can be further 
integrated with dynamical systems such as oscillators'*’°, negative- 
feedback linearizing circuits'”’*, endogenous circuits’, cell-cell com- 
munication” ” and cellular computers”’, and can be implemented 
using RNA components™*”®, synthetic transcriptional regulation” or 
protein-protein interactions”. 

Using fundamental properties of the scaling laws of thermodynamic 
noise with temperature and molecular count, which hold in both 
biological and electronic systems, the advantages and disadvantages 
of analog computation versus digital have been analysed for neurobio- 
logical systems’ and for systems in cell biology’. These results show 
that analog computation is more efficient than digital computation in 
part count, speed and energy consumption below a certain crossover 
computational precision. Although the exact crossover precision var- 
ies with the computation, both in electronics and in actual biological 
cells, the use of feedback loops, calibration loops, technological basis 
functions, redundancy, signal averaging and error-correcting topolo- 
gies can push this crossover precision to higher values’. Alternatively, 
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for a given speed of operation, more energy must be expended in 
creating a higher molecular production rate, which leads to a higher 
molecular count and, thus, higher precision*’. Trade-offs between 
error and resource use are therefore inherent in the design of synthetic 
circuits in living cells. To demonstrate the tunability of our analog 
circuits, we constructed an AraC PFS circuit with two Pg,4p promoters 
on the shunt plasmid, leading to an increase in the log-linear gain of 
about twofold relative to the circuit’s single-Pg4p counterpart 
(Supplementary Fig. 25). We have also analysed the sensitivities of 
our circuits, defined as the fractional change in the output divided 
by the fractional change in the input, and found that they compare 
favourably to circuits operating with positive feedback only or in open- 
loop configurations (Supplementary Fig. 27). 

We propose that an efficient and accurate computational approach 
to synthetic biological networks will ultimately integrate both analog 
and digital processing (a simple example of switched analog computa- 
tion is shown in Supplementary Fig. 24). This mixed-signal approach 
can utilize analog or collective analog computation® (the latter with 
many interacting moderate-precision analog circuits) for front-end 
processing in concert with decision-making digital circuits; or it may 
use graded feedback for simultaneous analog and digital computation, 
as in neuronal networks in the brain’. Thus, future efforts can seek 
to integrate synthetic analog and digital computation in living cells to 
achieve enhanced computational power, efficiency, reliability and 
memory. Such mixed-signal processing would benefit from the deve- 
lopment of circuits to convert signals from analog to digital, and vice 
versa’”°. These complementary approaches promise to broaden the com- 
putational power of synthetic biology and the range of applications that 
can be addressed using techniques from this emerging engineering field. 


METHODS SUMMARY 


Strains and plasmids. All plasmids in this work were constructed using basic 
molecular cloning techniques (Supplementary Information). Escherichia coli 10B 
(araD139 A(ara-leu)7697 fhuA lacX74 galK (80 A(lacZ)M15) mcrA galU recA1 
endAl nupG rpsL (StrR) A(mrr-hsdRMS-mcrBC)) or E. coli EP1300 (F- mcrA 
A(mrr-hsdRMS-mcrBC) ®80dlacZAM15 AlacX74 recAl endA1 araD139 A(ara, 
leu)7697 galU galK (- rpsL (StrR) nupG trfA tonA), where noted, were used as 
bacterial hosts for the circuits in Figs 1-3. 

Circuit characterization. Overnight cultures of E. coli strains were grown from 
glycerol freezer stocks at 37 °C, ina VWR 1585 shaking incubator at 300 r.p.m., in 
3 ml of Luria-Bertani-Miller medium (Fisher) with appropriate antibiotics: car- 
benicillin (50 pg m}), kanamycin (30 pg ml’), chloramphenicol (25 1g ml !). 
The inducers used were arabinose, isopropyl-B-p-1-thiogalactopyranoside and 
AHL 30C6HSL (Sigma-Aldrich). Where appropriate, CopyControl’* from 
Epicentre was added to overnight cultures at 1 active concentration. Over- 
night cultures were diluted 1:100 into 3 ml fresh Luria-Bertani medium and anti- 
biotics, and were incubated at 37°C and 300r-.p.m. for 20 min. Cultures (200 Ll) 
were then moved into 96-well plates, combined with inducers and incubated for 
4h 20min in a VWR microplate shaker (37°C, 700r.p.m.), until they had an 
attenuance of Déoo nm ~ 0.6-0.8. 

Cells were then diluted fourfold into a new 96-well plate containing fresh 
x1 PBS and immediately assayed using a BD LSRFortessa high-throughput 
sampler. Detailed characterization methods can be found in Supplementary 
Information. At least 50,000 events were recorded in all experiments, and these 
data were then gated by forward scatter and side scatter using CYFLOGIC v.1.2.1 
software (CyFlo). The geometric means of the gated fluorescence distributions were 
calculated using MATLAB. Fluorescence values were based on geometric means of 
flow cytometry populations from three experiments, each of which corresponded to 
50,000 events, and the error bars represent standard errors of the mean. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Strains and plasmids. All plasmids in this work were constructed using basic 
molecular cloning techniques (Supplementary Information). Escherichia coli 10B 
(araD139 A(ara-leu)7697 fhuA lacX74 galK (80 A(lacZ)M15) mcrA galU recAl 
endAl nupG rpsL (StrR) A(mrr-hsdRMS-mcrBC)) or E. coli EPI300 (F- mcrA 
A(mrr-hsdRMS-mcrBC) ®80dlacZAM15 AlacX74 recAl endA1 araD139 A(ara, 
leu)7697 galU galK A- rpsL (StrR) nupG trfA tonA), where noted, were used as 
bacterial hosts for the circuits in Figs 1-3. 

Circuit characterization. Overnight cultures of E. coli strains were grown from 
glycerol freezer stocks in a shaking incubator at 37 °C and 300r-.p.m. in 3 ml of 
Luria—Bertani-Miller medium (Fisher), with appropriate antibiotics: carbenicillin 
(50 pg ml), kanamycin (30 ugml’), chloramphenicol (25 ugml’). The indu- 
cers used were arabinose, IPTG and AHL 3OC6HSL (Sigma-Aldrich). Where 
appropriate, CopyControl’* from Epicentre was added to overnight cultures at X1 
active concentration. Overnight cultures were diluted 1:100 into 3 ml fresh Luria— 
Bertani medium and antibiotics, and were incubated at 37°C and 300r.p.m. for 
20 min. Cultures (200 pl) were then moved into 96-well plates, combined with 
inducers and incubated for 4h 20min in a VWR incubating microplate shaker 
(37 °C, 700 r.p.m.) until they had an attenuance of Dgoo nm ~ 0.6-0.8. 

Cells were then diluted fourfold into a new 96-well plate containing fresh 
X1 PBS and immediately assayed using a BD LSRFortessa high-throughput sam- 
pler. Detailed characterization methods can be found in Supplementary 
Information. At least 50,000 events were recorded in all experiments, and these 
data were then gated by forward scatter and side scatter using CYFLOGIC v.1.2.1 
software (CyFlo). The geometric means of the gated fluorescence distributions 
were calculated using MATLAB. Fluorescence values are based on geometric 


means of flow cytometry populations from three experiments and the error bars 
represent standard errors of the mean. 

When they varied, inducers were titrated across rows through serial dilutions. 

When an inducer was kept constant, it was added to cultures before they were 
moved to 96-well plates. 
Plate reader/FACS set-up. For each experiment, fluorescence readings were 
taken on a BioTek Synergy H1 Microplate Reader using BioTek GENS software 
to determine the minimum and maximum expression levels for cultures in each 
96-well plate. GFP fluorescence was quantified by excitation at a wavelength of 
484 nm and emission at a wavelength of 510 nm. mCherry fluorescence was quan- 
tified by excitation at 587 nm and emission at 610 nm. The gain of the plate reader 
was automatically sensed and adjusted by the machine. 

Cultures containing the minimum and maximum fluorescence levels, as deter- 
mined by the plate reader, were used to calibrate the fluorescein isothiocyanate and 
PE-TexasRed filter voltages on a BD LSRFortessa high-throughput sampler to 
measure GFP and mCherry expression levels, respectively. The FACS voltages 
were adjusted using BD FACSDIVA software so that the maximum and minimum 
expression levels could be measured with the same voltage settings. Thus, consist- 
ent voltages were used across each entire experiment. The same voltages were used 
for subsequent repetitions of the same experiment. GFP was excited with a 488-nm 
laser and mCherry was excited with a 561-nm laser. Voltage compensation for 
fluorescein isothiocyanate and PE-TexasRed was not necessary for any experi- 
ment. Supplementary Information, section 12, provides our FACS data. 
Plasmid maps and construction. Supplementary Information, sections 13 and 
14, provide details regarding plasmid construction and plasmid maps, respec- 
tively. The references in Supplementary Information, section 15, provide details 
regarding the origin of the plasmids. 
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BAF complexes facilitate decatenation of DNA by 


topoisomerase Ila 


Emily C. Dykhuizen’”*, Diana C. Hargreaves'**, Erik L. Miller'?, Kairong Cui*, Andrey Korshunov’, Marcel Kool’, 
Stefan Pfister®’, Yoon-Jae Cho®, Keji Zhao* & Gerald R. Crabtree!” 


Recent exon-sequencing studies of human tumours have revealed 
that subunits of BAF (mammalian SWI/SNF) complexes are 
mutated in more than 20% of all human malignancies'’, but the 
mechanisms involved in tumour suppression are unclear. BAF 
chromatin-remodelling complexes are polymorphic assemblies 
that use energy provided by ATP hydrolysis to regulate transcrip- 
tion through the control of chromatin structure’ and the placement 
of Polycomb repressive complex 2 (PRC2) across the genome*”. 
Several proteins dedicated to this multisubunit complex, including 
BRGI (also known as SMARCA4) and BAF250a (also known as 
ARID1A), are mutated at frequencies similar to those of recognized 
tumour suppressors. In particular, the core ATPase BRGI is mutated 
in 5-10% of childhood medulloblastomas®’ and more than 15% 
of Burkitt’s lymphomas’*"’. Here we show a previously unknown 
function of BAF complexes in decatenating newly replicated sister 
chromatids, a requirement for proper chromosome segregation 
during mitosis. We find that deletion of Brg1 in mouse cells, as well 
as the expression of BRG1 point mutants identified in human 
tumours, leads to anaphase bridge formation (in which sister chro- 
matids are linked by catenated strands of DNA) and a G2/M-phase 
block characteristic of the decatenation checkpoint. Endogenous 
BAF complexes interact directly with endogenous topoisomerase 
Ila (TOP2A) through BAF250a and are required for the binding 
of TOP2A to approximately 12,000 sites across the genome. Our 
results demonstrate that TOP2A chromatin binding is dependent 
on the ATPase activity of BRG1, which is compromised in oncogenic 
BRGI mutants. These studies indicate that the ability of TOP2A to 
prevent DNA entanglement at mitosis requires BAF complexes and 
suggest that this activity contributes to the role of BAF subunits as 
tumour suppressors. 

Using Brg1*e"xe4 (Brg1™") actin-creER mouse embryonic stem 
cells (ESCs), we observed that tamoxifen-induced deletion of Brg1 
(Brgl "Vfl ERA)) results in the appearance of DNA bridges during ana- 
phase (Fig. 1a). This phenotype is characteristic of a deficiency in TOP2A 
function, which normally resolves DNA catenanes that develop during 
transcription and replication’. We determined the frequency of ana- 
phase bridges in Brg1!"(ERA) cells to be similar to that of cells deficient 
in other putative tumour suppressors that regulate TOP2A function, 
including BRCA1, RANBP2 and RECQLS (refs 13-15) (Fig. 1a). 

In previous studies, we recovered peptides from TOP2A in mass 
spectrometric analysis of endogenous BAF complexes’®. Immunoprecipi- 
tation of BAF complexes with antibodies to BRG1 recovered TOP2A 
and, conversely, immunoprecipitation of TOP2A revealed BRGI (Fig. 1b). 
This association is not dependent on DNA (Supplementary Fig. 1a, b). 
We detected this association in several additional cell types, including 
mouse embryonic fibroblasts (MEFs) and human embryonic kidney 
(HEK293T) cells (Supplementary Fig. 1c). 


Failure of TOP2A to resolve catenated DNA leads to slow progression 


through the G2/M phase of the cell cycle”. To understand the mitotic 
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Figure 1 | BRG1 associates with TOP2A and regulates its function. a, 
Anaphase bridges in Brg” and Brg1"/"(ERA) ESCs. Data represent number of 
slides from four independent experiments + s.e.m. b, Co-immunoprecipitation 
of BRG1 and TOP2A from nuclear lysates. c, DNA content in Brg!" and 
Brg1"/"(ERA) ESCs after release from double thymidine block. 7-AAD, 
7-aminoactinomycin D. d, Cell cycle analysis of Brg] and Brg1/"(ERA) ESCs 
with or without caffeine, an ATM/ATR inhibitor. e, Data represent five 
independent histone H3 serine 10 phosphorylation (H3(S10)P) cell cycle 
analyses + s.e.m. f, Data represent the mean of the average chromosome length 
per cell from 40 cells from metaphase spreads of BrgI”" and Brg1"(ERA) MEFs 
from two independent experiments. 
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Brgl’"(ERA) cells in G1/S using double thymidine block. After release, 
Brgl™" and Brg1""(ERA) cells transited through the cell cycle at the 
same rate until reaching G2/M, where the Brg1"”"(ERA) cells exhibited 
a substantial delay (Fig. 1c). In asynchronously dividing cells, this delay 
resulted in a 1.5- to twofold increase in Brg AV/l(ERA) cells in G2/M 
(Fig. 1d), similar to the treatment of cells with the topoisomerase II 
inhibitor ICRF-193 (ref. 18). Caffeine, an inhibitor of ATM/ATR, 
forces cells through an ICRF-193-induced decatenation checkpoint'® 
and similarly overrides the G2/M arrest in Brg1 A/V ERA) cells (Fig. 1d). 
Furthermore, expression of TOP2A(S1524A) (hereafter TOP2A(SA)), 
which fails to recruit MDC1 to chromatin upon initiation of the 
decatenation checkpoint”, alleviated the cell cycle delay, suggesting 
that Brgl™’ (PRA) cells arrest due to activation of the decatenation 
checkpoint (Fig. le and Supplementary Fig. 1d). Finally, chromosomes 
from Brg1"/"(ERA) cells are significantly longer than chromosomes 
from Brgl"" cells (Fig. 1f and Supplementary Fig. le), a defect also 
observed in TOP2A-deficient cells owing to its role in chromosome 
condensation’*”’. These data indicate that Brg1 deletion resembles the 
mitotic defects observed with chemical inhibition and/or knockdown 
of TOP2A 12171820, 

We investigated oncogenic point mutants of BRGI found in 
medulloblastoma and Burkitt's lymphoma (BRG1(G1232D), hereafter 
BRGI(GD); and BRG1(T910M), hereafter BRG1(TM))*"' by expres- 
sing Flag-tagged versions in Brg1"”” cells (Fig. 2a). The BRG1 mutants 
were incorporated into the BAF complex normally and did not alter the 
composition of the complex (Supplementary Fig. 2a). Although the 
T910M mutation is located in the ATP-binding pocket of BRG1, 
the G1232D mutation is downstream of the helicase superfamily 
carboxy-terminal (HELICc) domain and thus not obviously involved 
in ATP turnover (Fig. 2a). Subjecting BAF complexes containing 
BRG1(GD), BRG1(TM), wild-type BRG1 (hereafter WT BRG1) or 
BRGI1(K798R) (hereafter BRG1(KR))”"—the ATPase-dead point 
mutant of BRG1—to an assay for ATPase activity revealed that both 
cancer mutants are significantly compromised in ATPase activity, 
although not as profoundly as BRG1(KR) (Fig. 2b). BRG1(TM) is more 
severely compromised than BRG1(GD), which correlated with the 
viability of the respective cell line (Supplementary Fig. 2b). 

Cells expressing BRG1(GD) and BRG1(TM), but not WT BRGI, 
display increases in the percentage of G2/M cells and the incidence of 
anaphase bridges similar to that of Brg1 AV ERA) cells (Fig. 2c, d). 
Importantly, expression of the mutants in the presence of endogenous 
BRGI gives similar increases, although less severe, in G2/M percentage 
and anaphase bridge incidence compared to vector cells (Fig. 2c, d). 
The dominant nature of these mutants on cell cycle and anaphase- 
bridge formation suggests that medulloblastomas with both heterozyg- 
ous and homozygous mutations in BRG1 have these mitotic defects. 

To explore whether the increase in anaphase bridges contributes to 
increased chromosome instability as it does in TOP2A-deficient cells'*””, 
we analysed ploidy in the BRG1 mutant cell lines. Expression of 
BRG1(GD) or BRG1(TM) results in a significant increase in cells with 
>4n DNA content in both ethanol- and tamoxifen-treated cells (Fig. 2c 
and Supplementary Fig. 2c). We also observed a significant increase in 
the number of BRG1(GD)- and BRG1(TM)-expressing cells with ab- 
normal chromosome number in metaphase spreads from both ethanol- 
and tamoxifen-treated samples (Fig. 2e). These data indicate that the 
G1232D and T910M mutations in BRG1 can contribute to chro- 
mosome instability as a result of deficiencies in TOP2A function. 

We collected several BRG1 mutant medulloblastomas to determine 
whether the effects of TOP2A deficiency can be observed in primary 
tumours. We observed an increased proportion of anaphase bridges in 
each of five histologic samples from BRG1 mutant tumours relative to 
controls, suggesting that these tumours have decatenation defects 
(Fig. 2f and Supplementary Fig. 2d). Aneuploidy is common in medul- 
loblastoma and ranges from the partial gain or loss of single chromosomes 
to full tetraploidy*’*”. However, a recent study showed that the relative 
rate of tetraploidy of 5 BRG1 mutant tumours was similar to that of 15 
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Figure 2 | Expression of medulloblastoma-associated BRG1 mutants 
phenocopy TOP2A inhibition. a, Somatic mutations in BRG1 found in 
medulloblastoma. BRK, Brm and KIS domain; BROMO, bromodomain; 
DEXD/H, DEAD/DEAH box helicase domain; HAS, helicase SANT-associated 
domain. b, The DNA-stimulated ATPase activity of BAF complexes from 
WT BRGI-, BRG1(GD)-, BRG1(KR)-, BRG1(TM)- and vector control- 
expressing Brg1""(ERA) cells + s.e.m. ¢, Cell cycle analysis of Brg1/" MEFs 
expressing WT BRG1, BRG1(GD), BRG1(TM) or vector, treated with ethanol 
or tamoxifen (Tax). d, Cells were prepared as in c and the mean frequency of 
anaphase bridges + s.e.m. from three independent experiments was measured. 
e, Cells were prepared as in c and collected for metaphase spreads. The number 
of cells with greater than 40 chromosomes (AT) was quantitated from >50 
cells. Significance was calculated relative to vector control ethanol-treated 
Brgl™" cells where *P < 0.05, **P< 0.01, ***P < 0.0001. f, Various tissues 
were sectioned and scored for the number of anaphase bridges of total 
anaphases. MB, medulloblastoma; S, medulloblastoma subtype. 


BRGI wild-type tumours®. Additional sample characterization will be 
necessary to definitively assess whether BRG1 mutation causes mitotic 
defects through insufficient TOP2A function in medulloblastomas. 
Microarray analysis of Brg1"’" and Brg" (ERA) ESCs indicated 
that BRG1-dependent genes are not enriched for Gene Ontology terms 
related to DNA damage or repair™*. Additionally, we found no altera- 
tions in the abundance, post-translational modifications, cellular local- 
ization, or in vitro enzymatic activity of TOP2A in BrgI"(ERA) cells 
(Supplementary Figs 3a-f and 4). To test whether purified BAF com- 
plexes could enhance the enzymatic activity of recombinant TOP2A, 
we used the standard in vitro kinetoplast DNA-based decatenation 
assay. Immobilized BAF complexes increased the enzymatic activity 
of TOP2A (Supplementary Fig. 5a); however, immunoprecipitated 
BRG1-mutant BAF complexes also enhanced TOP2A activity, as did 
immunoprecipitated SUZ12-containing PRC2 complexes, indicating a 
nonspecific activity on a bare DNA template” that does not reflect our 
in vivo observations. The BRG1 mutants did, however, reduce the 
association between TOP2A and chromatin, such that more TOP2A 
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remained associated with chromatin after high salt wash in WT BRG1 
cells than in BRG1(TM), BRG1(GD) and vector cells (Fig. 3a and 
Supplementary Fig. 5b, c). Reduced binding of TOP2A to chromatin 
would be expected to compromise TOP2A function and could repres- 
ent an inability of TOP2A to associate with substrate DNA during 
decatenation. 

To identify defined regions of TOP2A binding across the genome, 
we performed a TOP2A chromatin immunoprecipitation combined 
with massively parallel DNA sequencing (ChIP-seq) in Brgi"’" and 
Brgl"'"(ERA) cells. We recovered very few peaks using traditional 
ChIP methods, so we used etoposide, a small molecule that freezes 
TOP2A ina covalent complex with DNA during the enzymatic process, 
thereby identifying sites of active TOP2A cleavage”. We recovered 
16,591 TOP2A peaks in Brgl’ 1 Cells and 4,623 TOP2A peaks in 
Brg1"!"(ERA) cells, demonstrating the contribution of BRG1 to TOP2A 
binding (Fig. 3b). Almost two-thirds of the TOP2A Brg1"" peaks are 
DNase-I-hypersensitive, consistent with the preference of TOP2A for 
nucleosome-free DNA”. An example reflecting these trends is shown 
in Fig. 3c. We confirmed TOP2A binding by ChIP-quantitative PCR 
(qPCR) at 14 BRG1-dependent and 10 BRG1-independent sites in 
Brg1"" and Brgl"(ERA) cells (Fig. 3d). In addition, we determined 
that TOP2A binding is mitigated in BRGI(TM) and BRG1(GD) 
mutant Brgl"!"(ERA) cells at BRG1-dependent sites (Fig. 3e). This is 
not the result of reduced binding of the BRG1 mutants to chromatin, as 
BRG1(TM) and BRG1(GD) bind similarly to WT BRG1 at these sites 
(Fig. 3f). Given that the BRG1(TM) and BRG1(GD) mutants display 
reduced ATPase activity, these data implicate a role for the ATP- 
dependent accessibility activity of BAF complexes in TOP2A binding 
and function across the genome, a function previously identified for 
yeast protein Snf5 in transcription”®. 

Owing to the dedicated nature of subunits within BAF complexes, 
TOP2A could be interacting with any BAF subunit. Indeed, we pre- 
cipitated TOP2A with antibodies to several dedicated subunits as 
determined by glycerol gradient centrifugation analysis (Fig. 4a and 
Supplementary Fig. 6a). Quantification of the precipitated TOP2A 
revealed that little TOP2A was recovered after immunoprecipitation 
with antibodies raised against BAF250a (amino acids 1236-1325) and 
BAF250b (also known as ARID1B; amino acids 1300-1350), whereas 
other antibodies immunoprecipitated TOP2A well (Fig. 4a). We reasoned 
that the BAF250a/b antibody might disrupt the interaction between 
TOP2A and the BAF complex if TOP2A bound directly to BAF250a/b. 
Indeed, TOP2A associated with full-length BAF250a and BAF250a 
(amino acids 1-1758), but not BAF250a (amino acids 1759-2285) in 
a heterologous expression system (Fig. 4b). This interaction is inde- 
pendent of BRG1 because we were unable to detect BRG1 in co-precipitates 
of BAF250a (1-1758) and TOP2A. Furthermore, the association between 
TOP2A and BRG1 was lost upon knockdown of BAF250a, with the 
most severe knockdown resulting in the most severe loss of association 
(Fig. 4c and Supplementary Fig. 6b). To determine whether the inter- 
action between TOP2A and BAF250a was physiologically relevant, we 
knocked down BAF250a in MEFs and observed frequencies of ana- 
phase bridges and G2/M delay similar to knockdown of Brg or Top2a 
(Fig. 4d, e and Supplementary Fig. 6c, d). These data indicate that TOP2A 
associates with BRG1 through a direct interaction with BAF250a. 

Our studies point to a new role for ATP-dependent chromatin 
remodelling in decatenating DNA. Reduced decatenation in vivo is 
revealed by the frequency of anaphase bridges and an increase in the 
number of cells in G2/M upon deletion of BRG1 or expression of the 
tumour-associated T910M and G1232D BRG1 mutants (Figs la, d and 
2c, d). Although mitotic defects have been noted in cells lacking BRG1, 
the cause of these defects was unclear”’. In addition to BRG1, loss of 
BAF250a also results in decatenation defects (Fig. 4d, e), which could 
reflect the high incidence of mutations in BRG1 and BAF250a in 
human tumours’”*. Our in vivo observations are reinforced by the 
requirement of BAF for TOP2A binding at DNase-I-hypersensitive 
BRGI binding sites (Fig. 3b-e). 
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Figure 3 | BRG1 facilitates the binding of TOP2A to chromatin in vivo 
through ATPase-dependent chromatin-remodelling activity. a, Chromatin 
pellets isolated from nuclei of WT BRG1-, BRG1(GD)-, BRG1(TM)- and 
vector-expressing BrgI""(ERA) ESCs lysed in + 500 mM NaCl. b, The number 
of DNase-I-hypersensitive (HS) TOP2A peaks of the total number of TOP2A 
peaks from TOP2A ChIP-segs in Brg1™/" and Brgl""(ERA) ESCs. 

c, Representative ChIP-seq tracks for TOP2A (in BrgI"’" and Brgl”"(ERA) 
ESCs), BRG1 and DNase I hypersensitivity. d, TOP2A ChIP-qPCR 
confirmation from Brg1"/" and Brgl""(ERA) ESCs. e, TOP2A ChIP-qPCR 
confirmation from WT BRG1-, BRG1(GD)-, BRG1(TM)- and vector control- 
expressing Brgl"(ERA) ESCs. f, BRG1 ChIP-qPCRs from WT BRG1-, 
BRGI1(GD)-, BRG1(TM)- and vector control-expressing Brgl™ (ERA) ESCs. 
Data in e and f represent means of triplicate experiments + s.e.m. 


The dependence of TOP2A on BAF function offers a possible 
explanation for the frequency with which BAF subunit mutations 
are detected in screens for driving mutations in human cancers. 
Anaphase bridges are often forcibly severed during cytokinesis”, 
resulting in partial or complete chromosome gains or losses as well 
as polyploidy if the cell fails to undergo mitosis'*'’. At present, the 
number of BRG1-mutant medulloblastomas analysed for ploidy status 
is insufficient to determine whether BRG1 mutation results in aneu- 
ploidy in human tumours. In the case of medulloblastoma, mutations 
in BRGI are often accompanied by activating mutations in the WNT 
signalling pathways and/or MYC amplification”’. Further studies 
highlighting these pairings will help define the contribution of reduced 
TOP2A function as a result of BRG1 mutation to tumorigenesis. 
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Figure 4 | TOP2A associates with the BAF complex through BAF250a. 

a, Immunoprecipitates from ESC nuclear lysates. Quantification of the 
precipitated TOP2A is shown. b, V5 was precipitated from HEK293T cells that 
had been transfected with Flag~TOP2A and either vector, V5-tagged full- 
length BAF250a (BAF250a FL) or V5-tagged BAF250a fragments. Lysates and 
anti-V5 precipitates were blotted for anti-V5, anti-Flag and anti-BRG1. 

c, BRG1 was immunoprecipitated from ESCs after BAF250a knockdown. 

d, MEFs with knockdown of Brg1, BAF250a or Top2a. Anaphase-bridge 
frequency is calculated for seven experiments + s.e.m. Significance was 
calculated relative to vector control cells where *P < 0.05, **P < 0.01, 

***P < (),0001. e, Cell cycle analysis of MEFs from d. Data represent the mean 
of the percentage of G2/M cells normalized to vector control from four 
experiments + s.e.m. 


METHODS SUMMARY 


Brgl deletion from Brgl "8 creER ESCs and MEFs was performed as previously 


described*. Lentiviruses were produced in HEK293T cells using polyethylenimine 
transfection. Cells were synchronized using double thymidine block. Cell cycle 
analysis was performed according to manufacturer instructions (BD Biosciences). 
TOP2A ChIP-seq was performed following etoposide fixation’®. qPCR, immuno- 
fluorescence, immunoprecipitation and western blotting were done using standard 
protocols. The chromatin fraction from nuclei incubated in varying concentra- 
tions of NaCl was analysed by western blot. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 
pia 


Brgl deletion from Brgl"’" creER ESCs and MEFs was performed as previously 
described*. Lentiviruses were produced in HEK293T cells using polyethylenimine 
(PEI) transfection. Cells were synchronized using double thymidine block. Cell 
cycle analysis was performed according to manufacturer instructions (BD 
Biosciences). TOP2A ChIP-seq was performed following etoposide fixation”. 
qPCR, immunofluorescence, immunoprecipitation and western blotting were 
done using standard protocols. The chromatin fraction from nuclei in varying 
concentrations of NaCl was analysed by western blot. 

Immunofluorescence. To quantify anaphase bridges, cells were fixed with 4% para- 
formaldehyde for 20 min, washed and stained with 4’,6-diamidino-2-phenylindole 
(DAPI; Sigma). The number of anaphases/telophases with bridges over the total 
number of anaphases (between 56 and 187 total anaphases per 25-mm slide) was 
recorded from each slide for more than four independent experiments. Brg” and 
Brgl"(ERA) ESCs were visualized with DAPI 72 h after tamoxifen treatment. Wild- 
type MEFs were infected with lentiviruses containing short hairpin RNA (shRNA) to 
Brg1, BAF250a or Top2a and analysed 48-96 h after infection. 

To stain for TOP2A and centromeres/microtubules, cells were blocked with 5% 
BSA/1% goat serum in phosphate buffered saline with 0.1% Tween-20 (PBST) for 
1h following fixation and incubated with anti-centromere (Antibodies Inc.) or 
anti-y-tubulin (Sigma) and anti-TOP2A (Santa Cruz) for 2 h. After several washes, 
anti-human Alexa Fluor 488 and anti-rabbit Alexa Fluor 568 were added for 1h. 
The cells were then stained with DAPI for 10 min and washed 3X PBS for 10 min 
each. The coverslips were mounted on slides with Vectashield Hard Mount 
(Vector Labs). 

To quantify anaphase bridges from paraffin-embedded human tumour sam- 
ples, slides were incubated 2 X 25 min in xylenes, then rehydrated in 100% eth- 
anol, then 95% ethanol, then water, for 2 min each. The slides were boiled in citrate 
buffer (pH 6.0) (Vector Labs) for 20 min and washed 2 X 5 min in PBS-Tween. 
The slides were then stained with DAPI for 10 min and washed 3 X 5 min with PBS 
before mounting with Vectashield Hard Mount. 

Cell synchronization. ESCs were incubated with 2mM thymidine for 7-8 h, 
released into fresh media for 7h, and then incubated with thymidine again for 
7h. The cells were washed several times with PBS, released into fresh media, and 
collected at time points thereafter. 

Cell cycle analysis. The cell cycle analysis was performed using BD Biosciences 
BrdU-FITC FACS kit. ESCs were incubated with BrdU for 1h and MEFs were 
incubated with BrdU for 4h. Brgl™” and Brgl'”"(ERA) ESCs were analysed 72h 
after tamoxifen treatment. Caffeine was added to media 2h before BrdU incuba- 
tion. To determine the per cent of cells in G2/M, DNA was stained with 7-AAD 
and analysed by FACS. 

H3(S10)P cell cycle analysis. Brg/" ESCs were infected with interference-RNA- 
resistant wild-type human TOP2A or TOP2A(S1524A) and shRNAs to mouse 
Top2a. Cells were stained with anti-H3(S10)P and analysed by flow cytometry 
72h after treatment with or without tamoxifen. 

Metaphase spread preparation. MEFs were grown to 85% confluence and incu- 
bated for 4 h with colcemid. Cells were collected and swelled by drop-wise addition 
of 1:1 0.4% KCI/0.4% sodium citrate for 7 min at 37 °C. Cells were then fixed by 
dropwise addition of 3:1 methanol/acetic acid for 20 min, spun down and fixed for 
another 30 min. Metaphases were dropped onto slides, dried on wet paper towels 
and stained with DAPI for visualization. Chromosomes were then measured and 
counted using ImageJ software. To analyse polyloidy, only cells with greater than 
35 chromosomes were counted to eliminate artefacts due to partial spreads. 
Gene expression profiling and analysis. RNA was isolated using TRIzol 
(Invitrogen) and reverse transcribed into complementary DNA using Super- 
Script III reverse transcriptase (Invitrogen). qPCR was performed on the 
StepOnePlus (ABI) machine using FastStart Universal SYBR Green Master with 
ROX (Roche). 

Immunoprecipitation. Nuclei were isolated from cells with buffer A (25 mM 
HEPES, pH7.6, 5mM MgCl, 25mM KCI, 0.05mM EDTA, 10% glycerol, 
0.1% NP-40) and lysed for 30 min in immunoprecipitation buffer (50 mM Tris- 
HCl, pH 8.0, 150mM NaCl, 0.1% NP-40). The chromatin was removed using 
centrifugation and the lysates were pre-cleared with 20 ul Protein A or Protein 
G Dynabeads for 30 min. The protein concentration was quantitated using the 
bicinchoninic acid (BCA) assay (Pierce) and adjusted to a final volume of 200 1] at 
a final concentration of 1.5mgml~' with immunoprecipitation buffer. Each 
immunoprecipitate was incubated with 3 ug of anti-BRG1 (Santa Cruz), anti- 
TOP2A (Abcam), anti-BAF45d (Crabtree laboratory), anti-BAF47 (Santa Cruz), 
anti-BAF57 (Bethyl), anti-BAF155 (Crabtree laboratory), anti-BAF60a (BD 
Transduction Laboratories), anti-BAF250a (Santa Cruz), anti-BAF180 (Bethyl), 
anti-BAF250b (Santa Cruz, Bethyl), anti-SS18 (Santa Cruz) anti-BAF200 (Santa 
Cruz) or anti-IgG (Santa Cruz) overnight at 4°C and then for 2h with 20 ul 


Protein A/G Dynabeads. The beads were washed four times with 1 ml immuno- 
precipitate buffer and re-suspended in 10 pl gel loading buffer (4 lithium dodecyl 
sulfate buffer; Invitrogen). 

Glycerol gradient centrifugation analysis. ESCs were lysed in buffer A (10 mM 
HEPES, pH 7.6), 25 mM KCl, 1 mM EDTA, 10% glycerol, 1 mM DTT and protease 
inhibitors (complete mini tablets (Roche) supplemented with 1mM _ phenyl- 
methylsulphonyl fluoride) on ice. Nuclei were sedimented by centrifugation 
(1,000g), re-suspended in buffer C (10mM HEPES, pH7.6), 3mM MgCl, 
100 mM KCl, 0.1 mM EDTA, 10% glycerol, 1 mM dithiothreitol (DTT) and prote- 
ase inhibitors), and lysed by the addition of ammonium sulphate to a final con- 
centration of 0.3M. Soluble nuclear proteins were separated by insoluble 
chromatin fraction by ultracentrifugation (100,000g) and precipitated with 
0.3mgml_' ammonium sulphate for 20 min on ice. Protein precipitate was iso- 
lated by ultracentrifugation (100,000g) and re-suspended in HEMG-0 buffer 
(25mM HEPES, pH 7.9, 0.1mM EDTA, 12.5mM MgCl, 100 mM KC\l) for gly- 
cerol gradient analyses. 800 1g of protein was overlaid on to a 10-ml 10-30% 
glycerol (in HEMG buffer) gradient prepared in a 14 X 89 mm polyallomer cent- 
rifuge tube (Beckman). Tubes were placed in a SW-40 swing bucket rotor and 
centrifuged at 4°C for 16h at 40,000 r.p.m. 0.5-ml fractions were collected and 
used in gel electrophoresis and subsequent western blotting analyses. 

Western blots. Nuclei were isolated from cells with buffer A (25 mM HEPES, 
pH7.6, 5mM MgCh, 25mM KCI, 0.05 mM EDTA, 10% glycerol, 0.1% NP-40) 
and lysed for 30min in RIPA buffer (10mM Tris, pH7.4, 150mM NaCl, 
0.1% SDS, 0.5% sodium deoxycholate, 1% Triton X-100, 1mM EDTA). 
Chromatin was either spun out or samples were sonicated before BCA analysis. 
Equal amounts of protein were boiled in gel loading buffer and loaded onto 4-10% 
BisTris NuPage gels. After transfer, blots were blocked in 5% BSA and incubated with 
anti-BRG1, anti-TOP2A, anti-phosphotyrosine (Millipore), anti-phosphoserine 
(Millipore) or anti-ubiquitin (Santa Cruz). Proteins were detected using the LI- 
COR detection system or enhanced chemiluminescence (ECL)/autoradiography 
for SUMO-TOP2A. 

Chromatin association assay. Nuclei were isolated using buffer A and re-suspended 
in 20 mM Tris-HCl, pH 7.6, 3 mM EDTA at 60 million cells per ml. Samples of 25 11 
were alloquotted into tubes and NaCl concentrations were adjusted to a final volume of 
50 pl. Samples were gently mixed and incubated on ice for 20 min and centrifuged at 
high speed for 20 min to isolate chromatin. The lysate was removed and the chromatin 
pellet was re-suspended in 120 pl gel loading dye. The pellet was solubilized using 
sonication and the association of TOP2A to chromatin was analysed using western 
blotting. 

ATPase assay. ATPase assay was adapted from the literature*'. Immuno- 
precipitations were performed as described above with anti-BRG1 antibody. 
Immunoprecipitates were washed a final time with 10mM Tris-HCl, pH7.5, 
50mM NaCl, 5mM MgCl, 1mM DTT and re-suspended in 20 pl assay buffer 
(10 mM Tris-HCl, pH 7.5, 50mM NaCl, 5 mM MgCl, 20% glycerol, 1 mg ml! 
BSA, 0.5mM ATP, 20nM plasmid DNA, 50Ciml! y-**P ATP, 1mM DTT, 
protease inhibitors). The reaction was agitated at 37 °C for 1 h (or when approxi- 
mately 50% of ATP was converted to inorganic phosphate). Reaction mixture 
(0.5 pl) was spotted onto PEI cellulose plates and thin layer chromatography 
was performed in 0.5M LiCl and 1M formic acid. The plates were dried and 
imaged using phosphorimaging. The enzymatic activity was quantitated as a ratio 
of product ([?P]P;) to starting material ([y-**P]ATP). Values were normalized to 
the activity of wild-type BRG1 (100%) and vector control (0%) cells. 
Chromatin immunoprecipitation. For the BRG1 ChIP, 40 million ESCs were 
fixed for 12 min in 1% formaldehyde at room temperature (25 °C). Nuclei were 
sonicated in 1 ml ChIP lysis buffer (50 mM HEPES, pH 7.5, 150mM NaCl, 2mM 
EDTA, 1% Triton X-100, 0.1% SDS) to yield fragments between 200-500 base 
pairs. Five-hundred microlitres of lysate was incubated with 5 1g of anti-BRG1 
(Crabtree laboratory) or 5 ug anti-rabbit IgG and rotated overnight at 4 °C and 
then for 2h with 20 pl Protein A/G Dynabeads. After five washes with ChIP lysis 
buffer and one wash in 10 mM Tris-HCl, pH 8.0, 1 mM EDTA (TE), DNA was 
eluted by boiling in 10% Chelex slurry. 

The etoposide ChIP of TOP2A was adapted from the literature’. Specifically, 
20 million ESCs were treated with 100 [1M etoposide for 10 min. Cells were washed 
once with PBS and lysed with 1 ml of a buffer containing 1% Sarkosyl, 10 mM Tris- 
HCl, pH 7.5, 10mM EDTA and protease inhibitor. A solution of 7M CsCl (7 M) 
was added to a final concentration of 0.5 M and the lysate was sonicated to yield 
fragments between 200-500 base pairs. ChIP buffer (300 ul) was added to 300 ul of 
lysate for a final concentration of 50 mM HEPES, pH 7.5, 300mM NaCl, 1 mM 
EDTA, 1% Triton X-100, 0.1% DOC, and 0.1% SDS and 3 jig anti- TOP2A pre- 
bound to 20 ul Protein G Dynabeads was added. The lysate was rotated overnight 
at 4°C and washed four times with ChIP lysis buffer, one time with LiCl buffer 
(10 mM Tris, pH 8.0, 0.25 M LiCl, 0.5% NP-40, 0.5% DOC, 1 mM EDTA) and one 
time with TE. The DNA was eluted with 300 tl of 1% SDS, 0.1 M NaHCO; for 
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20 min and removed from the beads. The solution was adjusted to 200 mM NaCl, 
10 mM EDTA, 40 mM Tris, pH 6.5, and 0.2 mg ml ' RNase A was added for 30 min 
at 37 °C. Proteinase K was added to 0.03 mg ml * and digested overnight at 55 °C. 
The DNA was extracted with phenol/chloroform and precipitated with ethanol for 
analysis by qPCR. Primers used for ChIP-qPCR are available upon request. 
ChIP-seq and analysis. The library preparation and sequencing was performed as 
previously described*’. Raw ChIP-seq reads were mapped to the Mus musculus 
genome (build mm9/NCBI37) using the short-read aligner Bowtie (version 
0.12.7). Peaks were then called using Model-base analysis of ChIP-seq (MACS) 
(version 1.4.1)°**. Further analysis was aided by the Bedtools suite (version 2.16.2)”. 
Genome annotations were acquired from the UCSC Genome Browser (http:// 
genome.ucsc.edu/)***’. We also uploaded our data to the genome browser, which 
was used to produce screenshots of chromatin binding/modification profiles at 
individual loci. 

Topoisomerase activity assay. Reactions contain: 150ng kinetoplast DNA 
(Topogen), 50 mM Tris-HCl, pH 8.0, 150 mM NaCl, 10 mM MgCl, 2mM ATP, 
a standard TOP2A immunoprecipitate or varying amounts of recombinant 
TOP2A (Topogen). 

Lentiviral infection. HEK293T cells were transfected with lentiviruses containing 
vector alone, wild-type BRG1, BRG1 point mutants, wild-type human TOP2A or 
human TOP2A(S1524A), or with vectors containing shRNA to Brgl (clone ID 
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TRCN0000071386), Aridla (clone ID TRCN0000071395, Origene) or Top2a 
(clone ID V2LMM_11295). After 48h, supernatants were collected and centri- 
fuged at 20,000 r.p.m. for 2h. Viral pellets were re-suspended in PBS and used to 
infect ESCs in suspension or MEFs by spinfection. Cells were selected with pur- 
omycin and collected 48-96 h after infection for analysis. 
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Specialized filopodia direct long-range transport of 
SHH during vertebrate tissue patterning 


Timothy A. Sanders'**, Esther Llagostera'*“* & Maria Barna’?* 


The ability of signalling proteins to traverse tissues containing 
tightly packed cells is of fundamental importance for cell specifica- 
tion and tissue development; however, how this is achieved at a 
cellular level remains poorly understood". For more than a century, the 
vertebrate limb bud has served as a model for studying cell signalling 
during embryonic development’. Here we optimize single-cell real- 
time imaging to delineate the cellular mechanisms for how signalling 
proteins, such as sonic hedgehog (SHH), that possess membrane- 
bound covalent lipid modifications traverse long distances within 
the vertebrate limb bud in vivo. By directly imaging SHH ligand 
production under native regulatory control in chick (Gallus gallus) 
embryos, our findings show that SHH is unexpectedly produced in 
the form of a particle that remains associated with the cell via long 
cytoplasmic extensions that span several cell diameters. We show 
that these cellular extensions are a specialized class of actin-based 
filopodia with novel cytoskeletal features that have not been previously 
described. Notably, particles containing SHH travel along these 
extensions with a net anterograde movement within the field of 
SHH cell signalling. We further show that in SHH-responding cells, 
specific subsets of SHH co-receptors, including cell adhesion mole- 
cule downregulated by oncogenes (CDO) and brother of CDO (BOC), 
actively distribute and co-localize in specific micro-domains within 
filopodial extensions, far from the cell body. Stabilized interactions 
are formed between filopodia containing SHH ligand and those 
containing co-receptors over a long range. These results suggest that 
contact-mediated release propagated by specialized filopodia con- 
tributes to the delivery of SHH at a distance. Together, these studies 
identify an important mode of communication between cells that 
considerably extends our understanding of ligand movement and 
reception during vertebrate tissue patterning. 

The regulated movement of key signalling proteins within tissues is 
a central feature of metazoan development that remains poorly under- 
stood at the cellular level’. Several mechanisms have been proposed to 
distribute signalling molecules, including free diffusion, transcytosis 
and directed transport of signalling receptors via filopodia, including 
those termed cytonemes within invertebrate embryos'*>. An example 
of a key signalling protein is SHH, which is instrumental in patterning 
the early embryo®. During limb development, SHH is produced by the 
zone of polarizing activity (ZPA), a small group of mesenchymal cells 
at the posterior margin of the limb bud, and it acts over a long range to 
specify the number and identity of digits produced’. How tight control 
in SHH distribution across several cell diameters is established remains 
poorly understood. Here we combine unique genetic and live-cell 
imaging approaches to investigate the mechanisms of long-range cell 
signalling underlying tissue patterning during vertebrate embryonic 
development. 

To image living chick embryos under the regulatory control of specific 
spatial and temporal regulatory elements, we developed a piggyBac 
transposon-mediated stable integration approach®. This genetically tract- 
able expression system, coupled with optimized confocal microscopy, 


enabled imaging of embryos (Methods) at single-cell and subcellular 
resolution (Fig. la and Supplementary Fig. 1). At first, we expressed 
membrane-tethered fluorescent proteins that illuminate individual 
mesenchymal cells within the developing limb bud in a mosaic fashion 
(Fig. 1b). Through this approach we uncovered an unexpected, intric- 
ate network of thin cellular extensions present on these cells spanning 
several cell diameters, which orient in many directions from the cell 
body along the anterior—posterior, proximal-distal and dorsal-ventral 
axis (Fig. 1c-e and Supplementary Fig. 2). All mesenchymal cells pos- 
sess several cellular extensions up to 150 1m long, with an average 
length of 34.27 + 9.6 um (mean + s.e.m.) ( = 24), revealing a surpris- 
ing morphology to these cells (Fig. lc-e and Supplementary Video 1). 
These extensions are remarkably fine, approximately 200 nm in diameter, 
at the resolving limit of conventional microscopy and can be labelled 
with membrane-bound (Fig. 1) but not cytoplasmic (Supplementary 
Fig. 3) fluorescent proteins. They are capable of elongating, ata maximum 
rate of 150nms_, retracting, and traversing the complex three-dimensional 
extracellular matrix of the limb bud (Fig. 1g, h, Supplementary Fig. 2 
and Supplementary Videos 2 and 3). Importantly, despite their 
dynamic nature, they form highly stabilized long-range interactions 
between cells, thereby revealing a new and complex landscape of cell- 
cell interactions mediated through cytoplasmic extensions within 
embryonic tissues (Fig. 1f, Supplementary Fig. 2 and Supplementary 
Videos 4 and 5). These extensions are not amenable to conventional 
fixation, which severely disrupts their structure and may have pre- 
cluded their previous identification (Supplementary Fig. 5). 

Actin-associated markers—including the high-affinity F-actin probe 
utrophin calponin homology domain (UCHD) fused to enhanced green 
fluorescent protein (UCHD-eGFP) and moesin-eGFP—decorate the 
entire length of cytoplasmic extensions, revealing that these structures 
are actin-based filopodia (Fig. 2a and Supplementary Fig. 4a). Unlike 
actin markers, tubulin cytoskeleton markers, such as microtubule- 
associated proteins TAU (also known as MAPT) and EB3, only label 
the proximal base of a subset of mesenchymal filopodia (data not shown). 
We next examined the localization of a plus-ended actin motor, myosin- 
X°. Notably, myosin-X-eGFP moves to the distal tips of the filopodia, 
where it accumulates, thereby revealing that actin motors can move 
along these structures (Fig. 2b and data not shown). 

Limb bud mesenchymal cytoplasmic extensions also possess distinct 
cytoskeletal features compared to typical filopodia, commonly characterized 
as actin-based linear extensions of the cell membrane, with limited 
lengths of up to 10 um (ref. 10). For example, LifeAct is a highly specific 
marker of filopodia in eukaryotic cells, but it unexpectedly labels only 
the proximal base of limb mesenchyme cytoplasmic extensions (Fig. 2c) 
and not their distal tips, reflecting a distinguishing feature. This is con- 
sistent with the fact that LifeAct does not label certain forms of highly 
modified actin, or actin that is highly coated with binding proteins". 
Additional specific features are also evident in the actin depolymerization 
factor cofilin, which has mainly been implicated in extending lamellipo- 
dial protrusions’*. Notably, cofilin-eGFP rapidly accumulates to the 
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Figure 1 | Mesenchymal cells of the developing limb bud possess long and 
highly dynamic cytoplasmic extensions. a, Left, HH14 chick embryo 
indicating the site of DNA injection. The red dashed line indicates the cross- 
sectional plane. Right, microinjected DNA in the coelom is shown in green; the 
electrodes position is indicated (LPM). NC, notochord; NT, neural tube. 

b, Diagram of piggyBac transposon system resulting in integration of 
transposon inverted terminal repeat (ITR)-flanked expression cassettes by 
piggyBac transposase. Cre recombinase flanked by loxP sites (black triangles) 
results in mosaic labelling from a /JoxP-containing reporter construct, 
containing membrane-palmitoylated mKate2 (a far-red fluorescent protein) or 
pmeGFP expressed via the tetracycline responsive element (TRE). Dox, 
doxycycline. CAGp denotes a ubiquitous promoter; 3G denotes an inducible 
transactivator protein. c, Confocal z-series acquired in vivo from an HH21 limb 


tips of limb mesenchymal filopodia, and its subsequent retraction 
back to the cell soma prefigures the rapid and dynamic retraction of 
filopodia extensions (Supplementary Fig. 4b and Supplementary Video 6). 
Cofilin—eGFP is also frequently localized to specific microdomains along 
these filopodia that are interrupted rather than labelling the entire pro- 
cess that may account, at least in part, for the greater dynamics of these 
filopodia, including movement and bends in specific sub-regions as they 
traverse extracellular space (Fig. 2d). Fascin, which enhances cofilin 
severing”, also labels filopodial extensions (Supplementary Fig. 4c). 
Together, these findings demonstrate that limb mesenchyme filopodia 
possess unique cytoskeletal features reflecting specialized properties, 
which include their considerable lengths, highly dynamic behaviours, 
and complex geometries. Our initial attempts to perturb mesenchymal 
filopodia formation using known molecular pathways (Methods) have 
proven ineffective, for example, through conditional inactivation of 
cell division cycle 42 (CDC42) in the limb bud (data not shown). 

To determine the functional role of mesenchymal filopodia, we used 
genetic strategies to label specific cellular populations with membrane- 
bound fluorescent proteins along the anterior—posterior axis of the 
mouse limb bud with respect to SHH signalling. This revealed that 
SHH-producing cells within the limb bud ZPA extend long filopodia 
(Supplementary Fig. 6a and Supplementary Video 7), which can orient 
along the anterior—posterior axis as well as the proximal-distal axis, 
with a further bias towards the apical ectodermal ridge that maintains 
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bud reveals an intricate network of cellular extensions (Supplementary Video 1). 
d, Single x-y plane, from c, highlighting the network of long cytoplasmic 
extensions among mesenchymal cells. e, A representative long extension 

(75 um) from c, marked by dotted line. f, Example ofan interaction between two 
cytoplasmic extensions. Cytoplasmic extensions emanating from two cells that 
are initially separated (left panel, yellow and white arrows) then extend until 
they interact and overlap (right panel, yellow and white brackets) to form 
stabilized interactions over a long-range (see also Supplementary Video 4). 
Time in min:s. g, Speed distribution of extending (black) and retracting (grey) 
cytoplasmic extension velocities; n = 8. h, Extension dynamics. Grey bars 
represent net length change in micrometres. Red line represents the mean 
velocity (nm s |). The x axis tick marks denote 1-min intervals. Scale bars, 

10 um (c-e), 3 um (f). 


the SHH and fibroblast growth factor (FGF) feedback loop (Supplemen- 
tary Fig. 7a, b). Moreover, mesenchymal cells within the anterior limb 
bud that respond to SHH also extend similar filopodia (Supplementary 
Fig. 6b); however, they show a bias in orientation along the anterior- 
posterior axis (Supplementary Fig. 7a, c). Thereby, these results demon- 
strate that both SHH-producing and SHH-responding cells extend 
specialized filopodia within the SHH signalling field. 

To determine the possible role of limb mesenchyme filopodia with 
respect to SHH signalling, we visualized key components of the SHH 
signalling pathway. To image SHH ligand, we developed a tightly regulated 
expression system directed by the endogenous G. gallus SHH minimal 
promoter and limb-specific enhancer element (ZPA regulatory sequence 
(ZRS))"*, coupled with doxycycline-inducible control to allow for transient 
gene expression in a small number of cells within the ZPA (Supplemen- 
tary Fig. 8a). SHH is produced as a precursor protein that undergoes 
autocatalytic processing to yield an amino-terminal signalling protein 
(SHHp) containing cholesterol and palmitate moieties’*. SHH fusion 
proteins with monomeric eGFP were constructed, including SHHp- 
eGFP that retains the cholesterol modification, and SHHy-eGFP that 
does not but produces a brighter and more photostable fluorescent 
fusion protein (Supplementary Fig. 8b and Methods). Consistent with 
the fact that SHHy can activate SHH signalling within the limb bud"®, we 
find that after ectopic expression of SHHy-eGFP, patched (PTC1) a 
marker of SHH activation, is expanded anteriorly within the limb bud 
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Figure 2 | Limb mesenchymal cytoplasmic extensions are a class of 
specialized actin-based filopodia. a, UCHD-eGFP-localization staining 
demonstrating that membrane-labelled pmKate2 filopodia extensions contain 
actin filaments. b, Myosin X-eGFP (MYOX-eGEFP) is localized to each 
pmKate2-labelled filopodium and is concentrated at the distal tip. 


(Supplementary Fig. 8e). Notably, expression of these SHH fusion 
proteins under doxycycline-inducible control does not perturb endo- 
genous SHH signalling as revealed by PTC1 expression, nor limb 
development or skeletal patterning (Supplementary Fig. 8d). 


Figure 3 | Live-cell imaging of SHH ligand production and transport 
within the limb bud. a, Schematic of piggyBac-mediated integration of 
transposon-flanked expression cassettes (ITR). The G. gallus SHH minimal 
promoter (SHHp) and ZRS element direct spatial expression in the limb ZPA 
of doxycycline-inducible transactivator protein (3G), which in turn allows for 
the temporal control of SHHy-eGFP or SHHp-eGFP and pmKate2. b, Left, a 
representative SHH-producing cell containing multiple long filopodia, with 
SHHp-eGFP present in discrete particles as well as in a more diffuse form 
localized along these extensions. Right, SHHy-eGFP is produced as a particle 
visualized within the cell soma (arrows) as well as along the filopodia. 

c, Representative timelapse images showing anterograde SHHy-eGFP particle 
movement (arrows) that accumulates at the tip of the certain filopodia but not 
others (indicated by double-headed arrow) (Supplementary Video 8). Time in 
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c, LifeAct-mKate2 marks only the proximal aspect of pmeGFP-labelled 
filopodia and does not label the entire extension, shown by the bracket. 

d, Cofilin-eGFP is present in interrupted domains along the filopodia, negative 
regions shown with brackets. Scale bars, 3 jim (a), 5 jum (b-d). 


Imaging of SHHy-eGFP under native regulatory control in the ZPA 
reveals that it is unexpectedly produced in the form of a particle approxi- 
mately 200 nm in size (Fig. 3a, b). These particles are not observed in the 
extracellular space, but remain associated with the SHH-producing cell 
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min:s, interval is four frames s__'. d, Particle dynamics graph demonstrating the 
movement of SHHy-eGFP particles relative to the filopodium; normalized 
distance to filopodia base is 0, and to filopodia tip is 1. e, Net particle movement 
graph demonstrating the net vectors of particle (n = 38) displacement 
represented as a percentage of the total filopodia length that particles traverse. 
Green denotes anterograde, blue denotes no displacement (<5%), and red 
denotes retrograde displacement. The relative thickness of each vector reflects 
the percentage of particles within each category. There is a statistically 
significant net anterograde movement of SHH particles away from the cell 
soma; P < 0.002. f, SHH-containing filopodia (red lines) are statistically more 
stabilized than filopodia without SHH (black lines); P< 0.001, n = 200 time 
points. All scale bars, 3 tum. 
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via long filopodial extensions. Imaging of SHHp-eGFP also reveals that 
cholesterol-modified SHH is similarly produced as a particle associated 
with filopodial extensions, and also displays more uniform localization 
along these extensions (Fig. 3b and Supplementary Fig. 8c). Importantly, 
such particles are an intrinsic property of the SHH molecule, as they are 
not formed by expression of cytoplasmic, palmitoylated or cholesterol- 
modified eGFP (data not shown). SHH particles only travel to specific 
subsets of filopodia emanating from the same cell, revealing tight 
selectivity and regulation over this process (Fig. 3c). 

High-speed, real-time imaging showed that SHHy-eGFP particles 
move in both anterograde and retrograde directions along filopodia 
(Fig. 3c, d and Supplementary Video 8), with a statistically significant 
net anterograde movement away from the cell body (P< 0.002) 
(Fig. 3e and Supplementary Fig. 9). The maximum velocity of ante- 
rograde particle movement, 120nms_— 1 is consistent with actin-based 
myosin motors’’. Moreover, filopodia containing SHH particles are 
stabilized and less dynamic than non-particle-containing filopodia 
(Fig. 3f; P< 0.001). These findings reveal that filopodia can distribute 
SHH ligand at a distance from the cell body. It remains to be deter- 
mined whether the SHH cholesterol modification, which has been 
proposed to either promote’® or restrict'® the spread of SHH, has 
additional functions in filopodial transport. To our knowledge, this 
is the first in vivo demonstration of SHH ligand production and move- 
ment, revealing an unexpected role for filopodia in this process. 

To determine the precise localization of SHHy-eGFP within filo- 
podia, we used an optimized split GFP complementation system, con- 
sisting of the spGFP,_,) and spGFP,, non-fluorescent GFP fragments 
that reconstitute a fluorescent GFP signal’. Notably, SHHy-spGFP}, 
can physically interact to reconstitute an extracellular leaflet-associated 
glycophosphatidylinositol (GPI)-anchored spGFP}_ 19, producing a GFP 
signal along the extracellular surface of limb mesenchyme filopodia 
in vivo (Supplementary Fig. 10a, d). We also cultured mesenchymal 
cells after electroporation of SHHy-spGFP), and applied a synthesized 
spGFP}_19 peptide exogenously to the media (Methods) that similarly 
produced a GFP fluorescent signal along the filopodial membrane, but 
not when cells do not express SHHy-spGFP,; or express a control 
GFP _;9 cytoplasmic fragment (Supplementary Fig. 10b-d). In addition, 
ectopic expression of SHHy-spGFP), tethered to the membrane as a 
result of its interaction with GPI-spGFP}_ 9 leads to the ectopic expan- 
sion of PTC1 expression (Supplementary Fig. 10e). Although the split 
GFP complementation system displays remarkable antibody affinity 
for GFP fragments”, we cannot exclude the possibility that ectopic 
activation of PTC1 may also derive from a freely diffusible form of 
SHHy-spGFP}, that we cannot detect. Collectively, these experiments 
suggest that SHHy-eGFP is localized to the extracellular leaflet of the 
filopodial membrane, where it is competent to signal and interact with 
SHH receptors. 

We next visualized additional SHH signalling components involved 
in the reception of SHH in vivo. In addition to PTC1, which serves as 
the identified receptor for SHH”', additional co-receptors of SHH, 
including the transmembrane proteins CDO and BOC, are necessary 
for long-range SHH signalling”. Interestingly, in the limb bud these 
co-receptors are only expressed in SHH-responding cells**. In contrast 
to other components of the SHH signalling pathway, such as PTC1- 
yellow fluorescent protein (YFP) and smoothened (SMO)-GFP that 
can localize to primary cilia (Supplementary Fig. 11), our live imaging 
reveals that CDO-GFP and BOC-GFP do not. Instead, they exhibit 
marked localization to discrete microdomains along long filopodial 
(average size 2.4 + 1.5 um; range 0.6-8.5 um) that remain static and 
display little lateral movement (Fig. 4a, b). Moreover, there is substan- 
tial co-localization of these co-receptors to these microdomains in only 
a subset of filopodia emanating from individual SHH-responding 
mesenchymal cells, reflecting tight spatial regulation (Fig. 4c). There 
is a statistically significant stabilization of the filopodia containing 
SHH co-receptors (Fig. 4d; P< 0.001). The molecular roles of CDO 
and BOC in transducing long-range signalling have been poorly 
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understood, although their binding to SHH is independent from 
PTC1 (ref. 25). The unexpected co-localization of these co-receptors 
on microdomains along filopodia suggests that they may participate in 
relaying activation of the pathway at a distance from the cell soma. 

To mark SHH-producing and -responding cells simultaneously and 
precisely, we further designed a hybrid GLI3 enhancer and promoter 
element (Methods and Fig. 4e). Using this dual-expression system, we 
find that filopodia of SHH-producing cells directly interact with filo- 
podia of SHH-responding cells that contain microdomains of BOC- 
eGFP (Fig. 4f). Similar stabilized interactions are evident between 
filopodia containing SHH ligand and SHH-responding cells that have 
undergone pathway activation as revealed by SMO-blue fluorescent 
protein (BFP) localization to cilia (Fig. 4g). Together, these results 
reveal that SHH-producing and -responding cells interact at a distance 
through filopodial membrane-to-membrane contacts containing SHH 
ligand and co-receptors. 
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Figure 4 | Filopodia on SHH-responding cells display an exquisite 
distribution and co-localization of SHH co-receptors that interact with 
SHH-producing filopodia. a, b, Live imaging of CDO-GFP (a) and BOC- 
GFP (b) expression in defined microdomains along the filopodial membrane, 
within subsets of filopodia but not others (arrows). Higher magnification 
images (a, b, right) show multiple positive microdomains of co-receptor 
localization (brackets) interspersed along the filopodia membrane. c, CDO- 
GFP and BOC-mKate2 are co-localized along microdomains of filopodia 
(arrows) labelled with membrane-associated near-infrared fluorescent protein 
(pmiRFP). d, BOC-containing filopodia (red lines) are significantly more 
stabilized than filopodia without BOC (black lines); P< 0.001, n = 160 time 
points. e, Expression system to label SHH-producing (left) and SHH- 
responding (right) cells specifically in the same limb bud. GLI3e/p denotes 
GLI3 enhancer/promoter. f, Representative three-dimensional image of a 
filopodia from a SHH-producing cell (pmKate2, red) that interacts with 
domains of BOC-GFP (green) along the filopodia membrane (pmiRFP, fuscia) 
of an SHH-responding cell. Arrows show interaction along BOC 
microdomains (brackets). g, SHH-producing cell, indicated by pmKate2 and 
marked by bracket, with a long filopodium containing SHHy-eGFP particles 
(arrows) that contacts a smoothened-positive cell (SMO‘; outlined bya dashed 
line). Smoothened-BFP localization to the cilium is a marker of SHH pathway 
activation. All scale bars, 3 um. 
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Our live imaging studies have identified a specialized class of filo- 
podia with distinct cytoskeletal features that localize and transport 
components of the SHH signalling pathway, uncovering an important 
mechanism for the distribution of signalling molecules within tissues. 
The highly stabilized interactions between filopodia containing SHH 
ligand and those containing SHH co-receptors strongly suggest that 
long-range activation of signalling may be mediated through direct 
receptor-ligand interactions between cell membranes at a distance. 
Indeed, SHH-producing and -responding cells can extend filopodia 
as long as 150 um in length encompassing the entirety of the 300-um 
field of SHH signalling’* within the limb bud. Interestingly, mesenchymal 
filopodia may share certain properties with those of cellular extensions 
previously described within invertebrate embryos*”®, and these findings 
are consistent with earlier studies including electron microscopy carried 
out more than 30 years ago describing the presence of fine cellular exten- 
sions on limb bud mesenchyme”. Future studies will be required to 
determine whether such ‘specialized filopodia’ are an inherent feature 
of many, additional cell types that may have escaped previous detection 
in fixed and stained samples. Moreover, an outstanding question is 
whether they rely on unique cellular machinery for their generation, 
composition and ability to transport signalling molecules. For example, 
our findings revealing that SHH is produced in the form of a particle, 
which travels along filopodial extensions in a highly directional manner, 
suggest that a yet unidentified molecular motor may be responsible for 
the movement of SHH along these structures. It is intriguing to specu- 
late that the movement of proteins, and perhaps other molecules such 
as nucleic acids, along specialized filopodial networks offers a new 
mechanism for controlling the precise delivery of molecular informa- 
tion among cells during vertebrate embryonic development, regenera- 
tion and pathological processes such as cancer metastasis. Considering 
the diverse cellular milieus in which signalling molecules have been 
shown to act, specialized filopodia may be a more adapted feature of 
certain signalling centres that operate alone or together with other 
proposed models for ligand distribution, including free diffusion 
through extracellular space. 


METHODS SUMMARY 


A spatially and temporally regulated piggyBac transposition system was developed 
to express fluorescent fusion proteins in a heritable fashion within the developing 
chick embryonic limb bud. The regulated expression of introduced transgenes was 
stable throughout embryogenesis and did not perturb normal limb development. 
Cellular morphology and characterization of specialized filopodia was assessed 
through the use of membrane-associated fluorescent proteins and selected mar- 
kers of the actin and tubulin cytoskeleton. Molecular components of the SHH 
signalling pathway were visualized as fusion proteins under endogenous regula- 
tory control. High-resolution live imaging of the embryonic limb bud was per- 
formed on custom-designed spinning disk confocal systems, allowing for the 
visualization of multiple fluorescently coupled proteins in real time, at high spatial 
resolution. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Plasmid expression constructs. For extended expression during chicken embryo- 
genesis, the piggyBac transposition system was used. The parental plasmid pCAG- 
EBNXN‘* containing the minimal 5’ (314 base pairs (bp)) and 3’ (242 bp) ITRs of 
the piggyBac transposon” was used as the plasmid backbone for expression 
experiments. PBX contains the cytomegalovirus (CMV) enhancer chicken f-actin 
(CAG) promoter expression cassette modified to include the Invitrogen Gateway 
R£A cassette, allowing for phiC31-mediated recombination from Gateway Entry 
vectors. For doxycycline-inducible expression, a modified TRE-3G enhancer min- 
imal promoter element (Clontech) replaced the CAG cassette of PBX to generate 
PBTREX. For the transposase-mediated insertion of the piggyBac transposon 
cassette, the improved piggyBac transposase” was expressed via the CAG promoter 
in the plasmid CAGEN (C. Cepko, Addgene plasmid 11160; ref. 31). Fluorescent 
reporter and signalling constructs were introduced into the Gateway system through 
high-fidelity amplification and insertion into pPENTR/DTOPO (Invitrogen) as an 
intermediate to facilitate cloning. The integrity of all constructs was confirmed by 
DNA sequencing. 

Membrane-associated fluorescent proteins. For multicolour labelling of cells 
and signalling components, the monomeric fluorescent proteins TagBFP (Evrogen), 
monomeric eGFP (K. Svoboda, Addgene plasmid 18696; ref. 32), superfolder GFP 
(sfGFP)**, mKate2 (Evrogen) and oligomeric iRFP (V. Verkhusha, Addgene plasmid 
31857; ref. 34) were selected based on their spectral characteristics as well as their 
suitability for live imaging of fusion proteins. Inner leaflet membrane-associated 
palmitoylated fluorescent proteins were generated by the addition of the 20-amino- 
acid sequence of rat GAP-43 MLCCMRRTKQVEKNDEDQKI to the N terminus of 
each individual fluorescent protein through sequential PCR amplification; these 
constructs are designated pm-XFP. 

Mosaic expression of membrane fluorescent proteins. For Cre-mediated 
recombination experiments, a loxP-SV40 pA stop-loxP cassette (LSL) (D. Stainier, 
Addgene 24334; ref. 35) was introduced into the parental ubiquitous and inducible 
promoter constructs (PB and PB-TRE, respectively). pmKate2 and pmeGFP were 
placed before and after the LSL cassette, respectively, to generate a reporter con- 
struct for Cre activity. To generate a self-inactivating Cre recombinase construct, 
Cre recombinase containing an N-terminal nuclear localization signal was placed 
between the /ox sites in the PB-LSL vector. 

Subcellular markers. To label F-actin, two markers were used that show improved 
cell viability relative to eGFP-actin, UCHD-eGFP (D. Mullins) and LifeAct- 
mKate2. LifeAct-mKate2 was generated by the addition of the 17-amino-acid 
sequence of Abp140 from Saccharomyces cerevisiae (underlined) and linker 
sequence MGVADLIKKFESISKEEGDPPVAT to the N terminus of mKate?2 (refs 
36, 37). Additional markers of the actin cytoskeleton including bovine myosin 
X-heavy meromyosin-eGFP (R. Cheney**’), human moesin-eGFP (S. Shaw, 
Addgene plasmid 20671; ref. 39), human cofilin-eGFP (J. Bamburg) and human 
fascin-eGFP (D. Vignjevic) were used. Markers of the tubulin cytoskeleton 
included TAU-GFP (P. Mombaerts) and EB3-Wasabi (Allele Biosciences). 
Human ARL13B-mKate?2 (J. Reiter) was used as a marker of cilia. Cholesterol- 
modified eGFP was constructed placing amino acids 197-437 of mouse SHH on 
the carboxy-terminal portion of eGFP. The above fluorescent protein constructs 
were amplified by high-fidelity PCR, inserted into the pPENTR/DTOPO vector and 
subsequently cloned into the piggyBac expression construct. 

SHH signalling pathway fusion proteins. Fluorescent fusion proteins were amp- 
lified by high-fidelity PCR or overlap-extension PCR, inserted in the pENTR/ 
DTOPO vector and subsequently cloned into a piggyBac expression construct 
(see above). Mouse Shh cDNA was provided by A. McMahon. SHHp-eGFP was 
generated by placing monomeric eGFP between Ser 196 and Gly 197 of mouse 
SHH through overlap-extension PCR. We optimized the position of monomeric 
GFP coding sequence relative to the SHH cholesterol modification site to improve 
in vivo expression**’. This resulted in an improved GFP signal in vivo; however, 
SHHp-eGFP was less intense than SHHy-eGFP and other fusion proteins, which 
may reflect inefficient processing or stability issues**. SHHy-eGEFP, a fusion pro- 
tein lacking the C-terminal proteolytic domain and resulting cholesterol addition, 
was generated by deleting amino acids 197 to 437 after the eGFP fusion. CDO- 
GEP and BOC-GFP were made as C-terminal fusion proteins’. Mouse BOC- 
mKate2 was generated by replacing the cytoplasmic tail GFP with mKate2. Murine 
SMO-TagBFP was generated replacing the fluorophore of SMO-GFP (J. Reiter). 
PTC1-YFP was from J. Reiter. 

spGFP complementation system for SHH. The optimized split GFP comple- 
mentation system described previously’’ was used to assess whether SHHy-eGFP 
was localized to the extracellular surface of filopodia. SHHy-spGFP ,; was gener- 
ated by placing the M3 peptide after amino acid 196 of mouse SHH separated by a 
flexible linker, (GGGS)X3. SHHy-spGFP,; and GPI-linked CD14-spGFP} 10 
(ref. 45) were inserted into the PB-TRE expression construct. There was no 
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detectable fluorescence observed with either construct independently consistent 
with previous studies’”. 

To address further the subcellular localization of SHH in relation to mesenchymal 
filopodia, we used the exogenous application of a commercially produced spGFP_;o 
(Sandia Laboratories)’. In brief, after the electroporation of the chick embryo 
somatic lateral plate mesoderm, ex vivo cultures of limb mesenchymal cells were 
prepared as described previously**. After plating of these cells, the purified spGFP _10 
fragment was added exogenously to the live culture and imaged. Positive inter- 
action between the SHHy-spGFP); and the spGFP}_ 9 fragments was observed as 
a fluorescent signal 4 h after reagent addition. As a control for possible endocytosis 
of the spGFP)_19 fragment, a spGFP,, luciferase fusion protein localized to the 
cytoplasm did not produce a fluorescent signal. 

G. gallus SHH promoter and enhancer element. The 1.7-kilobase (kb) SHH limb 
specific enhancer element designated the ZRS, and the 1.1-kb chicken SHH minimal 
enhancer element'* were cloned into the PBX vector replacing the CAG promoter 
cassette to generate a Gateway compatible expression construct. Subsequently, 
pmEGFP or the tetracycline activator protein 3G (Clontech) were inserted into 
the expression cassette to allow for either constitutive or doxycycline-inducible 
spatially restricted expression in the ZPA. 

GLI3 intronic enhancer and promoter element. To allow for fluorescent trans- 
genes to be expressed in SHH-responding cells of the limb bud, a screen for promoter 
elements that would correctly regulate expression was performed. In brief, several 
human enhancer elements that were previously identified following chromatin 
immunoprecipitation DNA sequencing (ChIP-seq) of the enhancer associated 
p300 protein and mouse transgenic analysis*’ were subsequently tested in chick 
embryos when coupled to various minimal promoter elements, HSP68, E1b, thy- 
midine kinase and mimimal CMV. Specific expression was achieved when the 
human GLI3 intronic enhancer element, hs1586 (ref. 48), was coupled with the rat 
minimal Gli3 promoter”. Subsequently, pmKate2 (Supplementary Fig. 12) or 3G 
(Clontech) were inserted into the expression cassette to allow for either constitu- 
tive or doxycycline-inducible spatially restricted expression in the GLI3-expressing 
and SHH-responding cells of the limb. 

Chicken embryo manipulation and electroporation. Fertilized chicken eggs 
(G. gallus) were purchased from Petaluma Farms and subsequently stored at 
16 °C. Eggs were incubated in a non-rotary incubator at 38.5 °C until the desired 
stage according to Hamburger and Hamilton (HH). Stage HH13-15 chick 
embryos were windowed following standard techniques in preparation for elec- 
troporation*’. Embryos were visualized with the assistance of a 470/40 nm band- 
width emission filter, which provided the necessary contrast for injection. PBS 
without Ca”*/Mg’* was applied to the embryo. The vitelline membrane above the 
forelimb field was carefully sub-dissected, and additional solution was placed over 
the embryo. DNA constructs, with a final concentration 1-5 pg wt, diluted in 
endotoxin-free HO were combined with phenol red (0.1 mM final concentration) 
to aid in visualization. A 1.0-mm inner diameter capillary glass electrode was 
backfilled with DNA injection solution and a volume of solution was pressure 
injected (WPI Picopump) into the embryonic coelom, to fill completely the 
anterior to posterior extent of the forelimb territory. For the negative electrode, 
a 250-uum diameter platinum rod with a 4-mm length and 2-mm exposed surface 
(Nepagene) was inserted into the yolk and positioned beneath the forelimb field, 
approximately 0.5-1 mm below the embryo. A 250-y1m diameter platinum rod 
with a 1-mm exposed tip served as the positive electrode and was positioned above 
the forelimb field with an approximate distance of 2mm. A square-wave pulse 
train consisting of 8 V, three pulses, 50-ms duration with a 1-s interpulse interval 
was delivered via a Nepa 21 electroporator (Nepagene). This delivery resulted in an 
approximate current of 8-14 mA with energy of 10-18 mJ. Embryos were returned 
to 37.5 °C for the remainder of the incubation period. In experiments using the 
piggyBac transposition system, a 1:5-1:10 molar ratio of piggyBac transposase 
helper plasmid, HypBase ,was combined with the transposon expression construct 
to mediate integration and high-level expression. This ratio resulted in persistent 
expression in the embryonic limb through embryonic day (E)12, 10 days after 
electroporation (Supplementary Fig. la and unpublished observations). Moreover, 
limb development and resulting morphology was normal as assessed with Alcian 
Blue cartilage staining (Supplementary Fig. 1b). For induction of gene expression 
with the inducible 3G system, 12-24 h before imaging, 50 ng doxycycline (Clontech) 
in 500 pl in HBSS was injected beneath the embryonic vasculature. For experi- 
ments with iRFP as a fluorescent protein, 75 ng biliverdin (Frontier Scientific) was 
administered more than 4h before imaging. 

Live imaging of embryos. To facilitate accessibility of the chick embryo to live 
imaging with minimal perturbation, embryos were cultured ex ovo with improved 
viability and sustainability’. In brief, eggs after 48 h of incubation at 38.5 °C were 
prepared in sterile fashion and the embryo was directly transferred to a sterile 
autoclaved 60-mm diameter 35-mm depth crystallization dish by cracking the egg 
and allowing the albumin and yolk to fall gently into the vessel. Five millilitres of 
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sterile PBS without Ca**/Mg** containing 10% penicillin/streptomycin solution 
was added to prevent dehydration. The embryo was subsequently covered with a 
vented sterile lid and placed at 37.5 °C. Electroporation was performed as described 
above. For live imaging on a confocal Axio examiner system (see below), a custom- 
heated stage top incubator (BioOptechs) was designed allowing for the insertion of 
a water dipping objective for continuous imaging while maintaining temperature, 
humidity and normal growth of chick embryos (Supplementary Fig. 1c). 

For imaging with the Zeiss Axio Observer confocal system, mouse embryos or 
electroporated chick embryos were collected into imaging media (DMEM/F12 
with HEPES without phenol red containing 10% heat-inactivated FBS; Invitrogen). 
Extraembryonic membranes were carefully removed and the entire embryo or the 
isolated forelimb was transferred and positioned on a 35-mm glass bottom culture 
dish containing a 14-mm German glass coverslip as its base (MatT ek Corporation). 
A 12-mm coverslip was placed above the embryo secured ona ring of Vaseline that 
served to elevate the coverslip from the embryo, this placement was done to limit 
the movement of the limb bud during imaging. The chamber containing the 
embryo or limb bud was placed in a 37 °C heated microscope incubator (Solent 
Scientific) and imaged as described below. 

Mice. The mT/mG (ref. 53), Shh???" (ref. 54) and GliC=R!* (ref. 55) mice 
were purchased from Jackson laboratories and maintained on C57BL/6] back- 
ground. The mT/mG transgenic line is a double-florescent Cre reporter mouse that 
expresses tandem dimer Tomato (mT) before Cre-mediated excision and membrane- 
targeted enhanced green florescent protein (mG) after excision to mark defined 
populations of limb mesenchymal cells. For induction with tamoxifen, 4mg of 
tamoxifen dissolved in corn oil (Sigma) was delivered via orogastric gavage at E9.5 
and E10.5 for Shh’? : mT/mG/* and Glic’=®"* ; mT/mG/* mice, respect- 
ively. For deletion of Cdc42 experiments”, in which Cdc42 is critically required for 
filopodia formation in vitro’, tamoxifen was delivered on E9.5 for Gli’’8!/*; 
mT/mG/*; Cdc42!°*?/2*P mice. All animals were maintained at The University of 
California, San Francisco, and procedures were performed using Institutional 
Animal Care and Use Committees (IACUC)-approved protocols that adhere to 
the standards of the NIH. 

Confocal microscopy. Images were primarily acquired on one of two custom- 
built spinning disk microscopes. The Zeiss Axio Observer Microscopy system is 
coupled to a Perkin Elmer UltraVIEW Vox spinning disk confocal microscopy 
system. The UltraVIEW Vox system used 405 nm, 488 nm, 561 nm and 640nm 
solid state laser lines paired with emission filters (Semrock) that were specifically 
selected to minimize crosstalk across various fluorescent proteins when excited 
with appropriate solid state sources. Images were captured in the gain mode of a 
back-thinned electron multiplying CCD camera (Hamamatsu ImageEM C91003). 
Acquisition of images was accomplished with the Volocity Acquisition suite for 
multidimensional multichannel time-lapse recordings. Laser power and exposure 
settings were adjusted to minimize photoxicity during the sequence acquisition, 
power output measured at the entrance to the spinning disk of less than 2.5 mW 
for all excitation wavelengths. 

In ovo imaging was performed on a Zeiss Axio Examiner Microscopy System 
with a continuous zoom system coupled to a custom built confocal system, with a 
modified Yokogawa CSU10 scan head that contains an additional patterned array 
of microlenses. A Hamamatsu ImageEM gain CCD camera was used to capture 
multichannel and multidimensional images. A custom-built heated stage top 
incubator allowed for precise insertion of the water-dipping objective for continu- 
ous imaging of chick embryos in a micro-controlled environment (Supplementary 
Fig. 1, see live imaging of embryos). 

Image analysis. Acquired images were processed through the use of the Volocity 
6.0 Visualization and Quantification suite (Perkin Elmer). Image size calibrations 
were performed for each objective and optivar setting with a calibrated graticule 
(Electron Microscopy Sciences). Manual measurements of filopodia, fluorescent 
puncta and trajectories were obtained across the z space of a three-dimensional 
stack as well as in time for continuous imaging. Volocity-recorded values were 
transferred and processed in Microsoft Excel for subsequent quantification and 
presentation. For determining filopodia orientation along limb axes, the relative 
bearing of each filopodial extension across mouse genotypes was categorized into 
four quadrants corresponding to the anterior, distal, posterior and proximal axes. 
This quadrant categorization, encompassing 45° around the centre point, allowed 
for the comparison of vector orientations. Statistical analysis was performed 
assuming homogeneous distribution of variances and applying Student’s t-test 
between percentages of anterior and posterior orientation versus proximal and 
distal orientation per cell. For the measurement of fluorescent intensities values, 
maximal cellular intensity values were obtained with the subsequent subtraction of 


background fluorescent intensity. Filopodia were manually selected and fluorescent 
intensity values were determined using Volocity 6.0 Quantification suite. These 
values were normalized to maximal intensity for the cell to account for differences 
in protein expression levels. Image presentations were generated in the Volocity 
6.0 Visualization suite. For select multicolour confocal acquisitions, deconvolution 
using the calculated point spread function was applied with the Volocity 6.0 
Restoration package. 
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Macropinocytosis of protein is an amino acid supply 
route in Ras-transformed cells 
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Macropinocytosis is a highly conserved endocytic process by which 
extracellular fluid and its contents are internalized into cells through 
large, heterogeneous vesicles known as macropinosomes. Oncogenic 
Ras proteins have been shown to stimulate macropinocytosis but the 
functional contribution of this uptake mechanism to the trans- 
formed phenotype remains unknown'*. Here we show that Ras- 
transformed cells use macropinocytosis to transport extracellular 
protein into the cell. The internalized protein undergoes proteolytic 
degradation, yielding amino acids including glutamine that can 
enter central carbon metabolism. Accordingly, the dependence of 
Ras-transformed cells on free extracellular glutamine for growth can 
be suppressed by the macropinocytic uptake of protein. Consistent 
with macropinocytosis representing an important route of nutrient 
uptake in tumours, its pharmacological inhibition compromises the 
growth of Ras-transformed pancreatic tumour xenografts. These 
results identify macropinocytosis as a mechanism by which cancer 
cells support their unique metabolic needs and point to the possible 
exploitation of this process in the design of anticancer therapies. 

To date, the induction of macropinocytosis by oncogenic Ras has 
been characterized in the setting of overexpressed proteins’. To deter- 
mine whether stimulated macropinocytosis is a feature of cancer cells 
endogenously expressing oncogenic Ras, we analysed fluid-phase 
uptake in human pancreatic and urinary bladder cancer cell lines har- 
bouring oncogenic Ras mutations and compared uptake to wild-type 
Ras-expressing cells originating from carcinomas of the same tissue 
type. Macropinosomes were visualized on the basis of the ability of cells 
to internalize extracellular medium containing tetramethylrhodamine- 
labelled high-molecular-mass dextran (TMR-dextran), an established 
marker of macropinocytosis. Pancreatic adenocarcinoma-derived human 
MIA PaCa-2 cells, which are homozygous for the KRAS®“ allele’, dis- 
played appreciably higher levels of TMR-dextran uptake compared to 
BxPC-3 cells, which express wild-type KRAS° (Fig. la, b). That the 
TMR-dextran labelling in the oncogenic Ras-expressing cells reflects 
uptake through macropinocytosis is indicated by the observation that 
uptake was inhibited in a dose-dependent manner by 5-(N-ethyl-N- 
isopropyl)amiloride (EIPA) (Fig. 1c, d), which has been shown to inhibit 
macropinosome formation without affecting other endocytic pathways®”. 
Importantly, the knockdown of KRAS led to an attenuation of macro- 
pinocytosis, confirming the dependence of this uptake mechanism on 
oncogenic Ras expression (Supplementary Fig. la—d). This conclusion is 
further supported by the observation that bladder carcinoma-derived T24 
cells, which are homozygous for the HRAS®”" allele®, exhibit increased 
levels of macropinocytosis relative to 5637 cells, which express wild-type 
HRAS (Supplementary Fig. 2a-c)’. 

To examine whether oncogenic Ras-expressing cells engage in 
macropinocytosis in vivo, we used both a heterotopic xenograft mouse 


model and an autochthonous mouse model. For the heterotopic 
model, tumours were injected with fluorescein isothiocyanate- 
conjugated dextran (FITC-dextran) and intracellular uptake was 
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Figure 1 | Oncogenic KRAS-expressing pancreatic cancer cells display 
increased levels of macropinocytosis both in culture and in vivo. a, A 
macropinocytosis uptake assay using TMR-dextran as a marker of 
macropinosomes (red) indicates that MIA PaCa-2 cells display increased levels of 
macropinocytosis compared to BxPC-3 cells. DAPI (4’,6-diamidino-2- 
phenylindole) staining (blue) identifies nuclei. b, Quantification of macropinocytic 
uptake in pancreatic cancer cells. Data are expressed as arbitrary units and are 
presented relative to the values obtained for BxPC-3 cells. c, Macropinocytic uptake 
in MIA PaCa-2 cells treated with either vehicle (dimethylsulphoxide (DMSO)) or 
75 UM EIPA. d, Quantification of macropinocytic uptake in MIA PaCa-2 cells 
treated with 0, 25 or 75 [tM EIPA. Data are presented relative to the values obtained 
for the 75 [1M condition. e, Visualization and quantification of macropinocytosis in 
vivo. Representative images from sections of FITC-dextran (green)-injected tumour 
xenografts stained with anti-CK8 (red). Cell boundaries (white outline) were 
delineated on the basis of CK8 staining. Data are presented relative to the values 
obtained for the BxPC-3 tumours. For all graphs, error bars indicate mean + s.e.m. 


for n = 3 independent experiments with at least 300 cells scored per experiment. 
Statistical significance was determined by Student’s t-test; *P < 0.05, **P < 0.01. 
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assessed by fluorescent microscopy of tissue sections. The number of 
macropinosomes identified as FITC-positive puncta was markedly 
higher in tumours derived from MIA PaCa-2 cells relative to BxPC-3- 
derived tumours (Fig. le). To confirm that the macropinosomes are a 
feature of the transplanted cells rather than the host cells, the tumour 
sections were stained with anti-CK8 antibody, which selectively labels 
the transplanted human epithelial cells'® (Fig. le). To analyse macro- 
pinocytosis in mouse pancreatic tumours, we used an autochthonous 
mouse model of pancreatic cancer'’. In this model, animals of the 
genotype P48-cre; Isl-Kras“'*”; Trp53_'* (KPC) develop pancreatic 
intraepithelial neoplasia lesions within 4 weeks of birth and progress to 
invasive pancreatic ductal adenocarcinoma between 9 and 13 weeks’’. 
To assess macropinocytic uptake, KPC mice were injected with FITC- 
dextran at 12 weeks of age and pancreata were subsequently collected. 
FITC-positive macropinosomes were detected in CK19-labelled aci- 
noductal cells within mid- to late-stage pancreatic intraepithelial neo- 
plasia lesions of KPC pancreata but not in pancreata from wild-type 
mice (Supplementary Fig. 3). Altogether, these data indicate that an 
increased level of macropinocytosis is an attribute of cancer cells 
expressing oncogenic Ras both in vitro and in vivo. 

In mammals, approximately 70% of the soluble substances found in 
extracellular fluid can be accounted for by proteins, with serum albu- 
min being the most abundant’’. Therefore, we sought to determine 
whether protein uptake through macropinocytosis can be used by 
oncogenic Ras-expressing cells to meet their metabolic needs for 
proliferation. As an experimental system, we used oncogenic Kras- 
transformed mouse NIH 3T3 cells because of their documented 
dependence on glutamine". The expression of oncogenic Kras’'” in 
these cells was sufficient to stimulate a robust EIPA-sensitive macro- 
pinocytic response as measured by TMR-dextran uptake (Fig. 2a, b). 
To monitor the internalization of albumin, NIH 3T3 Kras” '* cells were 
incubated with a FITC-labelled form of bovine serum albumin (FITC- 
BSA). As shown in Fig. 2c, FITC-BSA was incorporated into discrete 
intracellular structures that co-localized with TMR-dextran. Similar 
uptake was also observed in MIA PaCa-2 and T24 cells (Supplemen- 
tary Fig. 4), indicating that albumin internalization can occur through 
macropinocytosis in cancer cells harbouring endogenous oncogenic 
Ras mutations. Inhibition of FITC-BSA uptake by treatment with 
EIPA confirmed that the uptake mechanism was macropinocytosis 
(Fig. 2d). 

The macropinocytic internalization and subsequent degradation of 
albumin could in principle lead to the generation of amino acids that 
sustain tumour cell bioenergetics and macromolecular synthesis’. To 
determine whether the internalized albumin is intracellularly 
degraded, we used a highly self-quenched BODIPY-dye-conjugated 
form of BSA (DQ-BSA) that emits a bright fluorescent signal only 
after proteolytic digestion’’. Dual labelling of cells with DQ-BSA 
and TMR-dextran was used to establish the macropinocytic origin 
of the degradative compartment. In NIH 3T3 KrasY” cells that were 
immediately fixed following a 30-min incubation with DQ-BSA and 
TMR-dextran (tf = 0), there was no appreciable DQ-BSA fluorescence 
detected in macropinosomes (Fig. 2e). However, in cells that were 
incubated for 30 min and subsequently chased for 1h in media free 
of both DQ-BSA and TMR-dextran, DQ-BSA fluorescence was 
detected in TMR-positive macropinosomes (Fig. 2e). DQ-BSA fluor- 
escence was also detected within macropinosomes after a 1-h chase in 
MIA PaCa-2 and T24 cells, indicating that these trafficking events were 
also occurring in cancer cells harbouring endogenous oncogenic Ras 
mutations (Supplementary Fig. 5). The degradation of macropinocy- 
tosed albumin was dependent on lysosomal hydrolases, as treatment of 
MIA PaCa-2 and NIH3T3Kras””* cells with bafilomycin Al pre- 
vented the degradation of DQ-BSA (Supplementary Fig. 6). These data 
demonstrate that oncogenic Ras-expressing cells can harness macro- 
pinocytosis for the internalization and degradation of extracellular 
albumin, and raise the possibility that plasma protein degradation 
may provide an important source of intracellular amino acids. 
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Glutamine is a key nutrient for many proliferating cells and is 
metabolized to glutamate and then o-ketoglutarate to enter central 
carbon metabolism'’. Ras-transformed cells exhibit an increased 
dependency on glutamine for growth and survival’*"*. To test whether 
the degradation of macropinocytosed albumin results in the produc- 
tion of intracellular glutamine, we directly measured the intracellular 
concentrations of glutamate and «-ketoglutarate in cells grown either 
in the absence or presence of albumin. NIH 3T3 Kras’”” cells were 
cultured for 24h in complete media supplemented with physiological 
concentrations of albumin (2gper 100ml, 2%). As a control, 
NIH 3T3 KrasY ” cells were grown in complete media alone. The addi- 
tion of albumin to the media led to EIPA-sensitive increases in intra- 
cellular concentrations of both glutamate and o-ketoglutarate 
(Supplementary Fig. 7), indicating that macropinocytic uptake of albu- 
min can increase the levels of glutamate and «-ketoglutarate in onco- 
genic Ras-transformed cells. 

NIH 3T3 Kras”” cells use glutamine as a main carbon source for 
tricarboxylic acid (TCA) cycle anaplerosis'’. Therefore, to directly 
trace the fate of protein-derived amino acids in oncogenic Ras- 
transformed cells, we cultured NIH 3T3 Kras’” cells in the presence 
of soluble, heat-inactivated, !*C-labelled yeast protein. Uptake of 
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Figure 2 | Oncogenic Kras-induced macropinocytosis in mouse NIH 3T3 
cells mediates the internalization of extracellular albumin, which is 
subsequently targeted for proteolytic degradation. a, TMR-dextran (red) is 
internalized at higher levels in NIH 3T3 Kras’” cells (Kras’!”) compared to 
untransformed control cells (CTL). b, Quantification of macropinocytic uptake 
in control cells and NIH 3T3 Kras” cells incubated with vehicle (DMSO) or 
with 75 uM EIPA. Data are presented relative to the values obtained for the 
untransformed control cells. Error bars indicate mean + s.e.m. for n = 3 
independent experiments with at least 300 cells scored per experiment. 
Statistical significance was determined by Student’s t-test; **P < 0.01, 

***P < 0.001. c, FITC-BSA (green) is internalized into discrete puncta that co- 
localize (white arrowheads) with TMR-dextran (red). d, FITC-BSA uptake is 
abrogated by treatment with 75 1M EIPA. e, Analysis of DQ-BSA fluorescence 
in NIH 3T3 Kras’ !” cells that were co-incubated with DQ-BSA (green) and 
TMR-dextran (red) and fixed either immediately (tf = 0) or following a 1-h 
chase. The fluorescent signal emanating from DQ-BSA (f= 1h) is an 
indication of albumin degradation. Insets represent a higher magnification of 
the boxed areas. Images shown in c-e are representative of at least three 
independent experiments. 
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yeast-derived protein through macropinocytosis was confirmed 
microscopically by the fluorescent co-localization of rhodamine- 
labelled protein and FITC-dextran (Fig. 3a). After 24h of culture in 
low-glutamine (0.2mM) complete medium supplemented with 
13C_labelled yeast proteins, intracellular metabolites were extracted 
and '°C labelling was quantified using gas chromatography/mass spec- 
trometry (Supplementary Table 1)”°. Substantial labelling of numerous 
intracellular amino acids, including glutamine, was detected (Fig. 3b), 
whereas only low levels of '*C amino acids were detected in media 
samples incubated for the same period of time without cells (Sup- 
plementary Fig. 8). Catabolized yeast protein also entered central 
metabolism, as evidenced by '*C-labelling of pyruvate, lactate and 
various TCA metabolites (Fig. 3c, d). Partially labelled mass isotopo- 
mers in numerous metabolites suggested that protein-derived amino 
acids were potentially being metabolized through several pathways, 
including glutamine anaplerosis/oxidation, acetyl-coenzyme A meta- 
bolism, reductive carboxylation and serine/glycine cycling (Fig. 3d and 
Supplementary Fig. 9). 

Increased sensitivity to glutamine deprivation is a hallmark of can- 
cer cells that express oncogenic mutants of Ras’*”*. Consistent with 
this phenotype, we found that MIA PaCa-2, T24 and NIH 3T3 Kras’ ? 
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cells exhibited decreased proliferation at sub-physiological levels of glu- 
tamine as measured using the MTT viability assay or by Syto 60 staining 
(Supplementary Figs 10 and 11). We next examined whether cell growth 
impairment due to glutamine deprivation could be reversed by culturing 
cells in the presence of physiological levels of albumin. Albumin sup- 
plementation of media containing sub-physiological glutamine concen- 
trations enhanced proliferation, and this effect was abrogated by EIPA 
treatment (Fig. 4a and Supplementary Fig. 11). The anti-proliferative 
response to EIPA observed in these settings was rescued by the addition 
of either extracellular glutamine or o-ketoglutarate (Fig. 4b). The ability 
of extracellular albumin to suppress the detrimental effects of glutamine 
starvation in MIA PaCa-2 cells was dependent on KRAS expression, as 
KRAS knockdown in these cells diminished the rescuing capacity of 
albumin (Supplementary Fig. 12). Together, these data indicate that 
the macropinocytic uptake of albumin could serve to sustain prolifera- 
tion of oncogenic Ras-transformed cells by constituting a source of 
glutamine, and potentially other amino acids. 

To investigate whether the uptake and degradation of albumin 
is a unique feature of oncogenic Ras-induced macropinocytosis, 
we used NIH 3T3 cells expressing Src*°?”F (NIH 3T3 Src*°?”*), a con- 


stitutively active form of Src’. Consistent with previous reports’, 
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Figure 3 | Macropinocytic uptake of extracellular protein drives the 
accumulation of catabolic intermediates and entry of protein-derived 
amino acids into central carbon metabolism. a, Rhodamine (Rh)-labelled 
yeast protein (red) is internalized into puncta (arrowheads) that co-localize 
with FITC-dextran (green). Insets represent a higher magnification of the 
boxed areas. b, Uniformly 13C_labelled intracellular amino acid pools were 
detected in NIH 3T3 Kras”” cells after culture in low-glutamine-containing 
medium (0.2 mM) supplemented with 2% '°C-labelled yeast protein. 

c, Protein-derived alanine enters central carbon metabolism upon 
transamination to pyruvate, and pyruvate can be directly converted to lactate. 


M3 reflects fully labelled alanine, pyruvate and lactate, whereas MO abundances 
reflect metabolites with no ‘°C label. M1 and M2 represent partially labelled 
species that are not present in substantial amounts. d, Atom-transition map 
depicting a model for the entry of amino-acid-derived carbons into the TCA 
cycle and isotopic labelling of various metabolites. Open circles represent 
unlabelled carbon, and different-coloured circles highlight labelling patterns 
that correspond to specific pathways as indicated in key. For all graphs, error 
bars indicate mean = s.d. for three independent experiments. IDH, isocitrate 
dehydrogenase; MID, mass isotopomer distribution. 


30 MAY 2013 | VOL 497 | NATURE | 635 


©2013 Macmillan Publishers Limited. All rights reserved 


LETTER 


NIH 3T3Src’°””" cells displayed increased levels of macropino- 
cytosis relative to untransformed control cells (Supplementary 
Fig. 13a). Moreover, Src-induced macropinosomes displayed the capa- 
city to internalize and degrade extracellular albumin (Supplementary 
Fig. 13b). Importantly, the decrease in proliferation of Src-transformed 
cells following glutamine deprivation (Supplementary Fig. 10) could be 
rescued by extracellular albumin (Supplementary Fig. 13c), suggesting 
that the use of albumin by cancer cells to augment amino acid supply 
can be mediated by inducers of macropinocytosis other than Ras. 

The above observations implicate macropinocytosis in providing 
nutrients to sustain cancer cell proliferation; therefore we sought to 
directly evaluate the role of macropinocytosis in tumour growth. Mice 
bearing MIA PaCa-2-derived heterotopic tumours were treated with 
EIPA or vehicle using an osmotic pump. EIPA administration com- 
menced when tumours attained an average volume of 50-100 mm’. 
After 7 days of treatment, FITC-dextran was delivered intratumorally 
and the tumours were excised. Tumour tissue collected from EIPA- 
treated animals displayed a reduction in macropinocytic uptake of 
FITC-dextran compared to vehicle-only controls (Fig. 4c). Moreover, 
relative to control tumours, those from EIPA-treated animals displayed 
an attenuation in growth and, in some cases regression (Fig. 4d, e). By 
contrast, EIPA administration had no effect on the growth rate of 
tumours derived from BxPC-3 cells, which display low levels of macro- 
pinocytosis (Fig. 4e). These results indicate that a reduction in macro- 
pinocytic capacity may compromise tumour growth. 

The importance of albumin catabolism in human Ras-driven 
tumours depends on the extent to which the tumours require certain 


amino acids in excess of the amounts readily attainable in free form 
from their environment. In human cancers, oncogenic Ras mutations 
are most common in adenocarcinomas of the pancreas”. In preli- 
minary metabolomic experiments, we have measured the abundances 
of 123 known water-soluble metabolites in human pancreatic cancer 
tumour specimens compared to benign adjacent pancreatic tissue. The 
most depleted metabolite in pancreatic tumour tissue in comparison to 
adjacent normal tissue was glutamine, with serine and glycine also 
decreased (data not shown). A common feature of these amino acids 
is their consumption in nucleotide synthesis. These preliminary data 
indicate that human pancreatic tumour tissue is depleted of these free 
amino acids and raise the possibility that in these depleted conditions, 
the acquisition of amino acids through macropinocytosis and protein 
degradation could contribute to tumour growth. In previous experi- 
ments investigating human tumour metabolism, it was found that the 
production of nitrogen waste from some colorectal tumours could be 
in tenfold excess relative to their uptake of free amino acids’’. The 
Ras-induced use of plasma proteins as a source of precursors for 
macromolecular synthesis and anaplerosis could explain both of these 
observations, a possibility that remains to be explored in future studies. 

Recent years have witnessed a renewed appreciation of the altered 
metabolic behaviour of tumour cells and the critical role that such 
metabolic reprogramming has in conferring growth and survival 
advantages to tumour cells. Here we provide evidence that macropi- 
nocytosis-mediated internalization of extracellular protein and its sub- 
sequent intracellular degradation may define a mechanism for amino 
acid supply in Ras-transformed cancer cells. Moreover, these findings 
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Figure 4 | Macropinocytosis is required for albumin-dependent cancer cell 
proliferation in vitro and for tumour growth in vivo. a, The compromised 
proliferation of oncogenic Ras-expressing cells resulting from growth in media 
containing sub-physiological concentrations of glutamine (0.2 mM, 0.2Q) is 
reversed by supplementation with 2% albumin (0.2Q + Alb) and this effect is 
inhibited by treatment with 25 .M EIPA (0.2Q + Alb + EIPA). Total viable cell 
counts were measured using an MTT assay after 6 days of growth. Data are 
presented relative to the values obtained for the 0.2Q condition. b, The effects of 
EIPA treatment (25 1M) are suppressed by increasing the glutamine levels in 
the growth media to the indicated concentrations (that is, 0.5Q indicates 

0.5 mM glutamine) or the addition to the medium of 7 mM dimethyl 
a-ketoglutarate (KG). Data are presented relative to the values obtained for the 
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0.2Q + Alb condition. For both a and b, error bars indicate mean + s.e.m. for 
n= 3 independent experiments. Statistical significance was determined using 
Student’s t-test; *P < 0.05, **P < 0.01. c-e, EIPA inhibits macropinocytosis in 
vivo and reduces tumour growth in a subcutaneous heterotopic xenograft 
model of pancreatic cancer. c, Representative images from sections of FITC- 
dextran (green) injected MIA PaCa-2 tumour xenografts from mice treated 
with EIPA or vehicle-only controls after 7 days of treatment. The human 
pancreatic cancer cells are marked by anti-CK8 staining (red). d, Representative 
digital photographs of dissected tumours from mice treated with EIPA or 
vehicle-only controls. e, Waterfall plots indicating the per cent change in 
tumour volume after 7 days of treatment relative to baseline (day 0 of 
treatment) for tumours derived from MIA PaCa-2 cells or BxPC-3 cells. 
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raise the question of whether the inhibition of macropinocytosis can be 
used for therapeutic targeting in a subset of cancers. 


METHODS SUMMARY 


Macropinosomes were imaged using an Axiovert 200 inverted fluorescent micro- 
scope (Zeiss) and analysed using the ‘Analyze Particles’ feature in ImageJ 
(National Institutes of Health). For the heterotopic xenografts, female homo- 
zygous NCr nude mice (Taconic) were injected subcutaneously, and when 
tumours reached an average volume of 500 mm° FITC-dextran (Invitrogen) was 
injected intratumorally. To evaluate the effects of EIPA on macropinocytosis, 
tumour-bearing mice were treated with EIPA (20 mg ml‘) using an osmotic 
pump (Alzet, Model 1004) at a pump rate of 0.11 ph '. Treatment was initiated 
when tumours attained an average volume of 50-100 mm*. Asa control, we used 
animals treated with vehicle only (DMSO in PBS). For tumour growth assays, 
volumes of the subcutaneous tumours were calculated on the basis of measure- 
ments obtained using digital calipers after 7 days of treatment. For the generation 
of '°C-labelled proteins, a prototrophic haploid SK1 strain of Saccharomyces 
cerevisiae was grown in synthetic complete media lacking amino acids (DIFCO) 
and containing 2% glucose that was either unlabelled, or uniformly ‘C-labelled 
(Cambridge Isotope Labs). Whole-cell extracts were prepared according to ref. 28. 
For intracellular metabolite analysis, cell lysate was collected and after extraction, 
the aqueous phase was collected and evaporated under nitrogen. Gas chromato- 
graphy/mass spectrometry analysis was performed using an Agilent 6890 GC 
coupled to a 5975C MS. Mass isotopomer distributions were determined by 
integrating the appropriate ion fragments and corrected for natural isotope 
abundance. For glutamine-deprivation assays, cells were cultured in the indicated 
glutamine concentration and viable cell counts determined using the MTT assay. 
For some conditions, media was supplemented with BSA (Fraction V, fatty-acid-, 
nuclease- and protease-free (Calbiochem)). For rescue experiments, dimethyl 
a-ketoglutarate (Sigma) was used at 7mM”. EIPA (Invitrogen) was diluted in 
DMSO and used at the indicated concentrations. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Cell culture. All cells were maintained under 5% CO, at 37 °C in medium sup- 
plemented with 10% FBS (Gibco). MIA PaCa-2 and T24 cells were maintained in 
DMEM (Invitrogen); BxPC-3 and 5637 cells were maintained in RPMI (Gibco) 
supplemented with 1mM sodium pyruvate (CellGro) and NIH 3T3 cells were 
maintained in DMEM supplemented with 1x MEM non-essential amino acids 
(Sigma). 

Macropinosome visualization and quantification. Cells were seeded onto glass 
coverslips. Twenty-four to forty-eight hours after cell seeding, cells were serum 
starved for 18h. Macropinosomes were marked using a high-molecular-mass 
TMR-dextran (Invitrogen)-uptake assay wherein TMR-dextran was added to 
serum-free medium at a final concentration of 1mgml' for 30 min at 37°C. 
At the end of the incubation period, cells were rinsed five times in cold PBS and 
immediately fixed in 3.7% formaldehyde. Cells were DAPI-treated to stain nuclei 
and coverslips mounted onto slides using DAKO Mounting Media (DAKO). 
Images were captured using an Axiovert 200 inverted fluorescent microscope 
(Zeiss) and analysed using the ‘Analyze Particles’ feature in ImageJ (National 
Institutes of Health). The total particle area per cell was determined from at least 
five fields that were randomly selected from different regions across the entirety of 
each sample. 

Interference RNA-mediated knockdown of KRAS. KRAS knockdown was 
achieved through the pTRIPZ doxycycline-inducible short hairpin RNA 
(shRNA) expression lentivirus system (Open Biosystems). shRNA sequences for 
scramble shRNA (5'‘-GGAAGTGCAATTATTCTATTA-3’) and KRAS shRNA 1 
(5'-TAGTTGGAGCTTGTGGCGTAG-3’) were cloned into pTRIPZ according to 
the manufacturer’s protocol. The KRAS shRNA 2 construct was purchased from 
Open Biosystems. MIA PaCa-2 cells were transduced with lentiviral particles con- 
taining the indicated pTRIPZ scramble shRNA or KRAS shRNAs and selected with 
21gml' puromycin (Calbiochem) for 3 days. shRNA expression was induced 
with 1 ug ml’ doxycycline and efficient knockdown was confirmed by immuno- 
blotting with KRAS-specific antibodies (sc-30, Santa Cruz Biotechnology). 

Mice. For the heterotopic xenografts, female homozygous NCr nude mice 
(Taconic) were injected subcutaneously in both flanks at 8 weeks of age with 
1 X 10° cells mixed at a 1:1 dilution with BD Matrigel (BD Biosciences) in a total 
volume of 100 yl. When tumours reached an average volume of 500 mm’, 1 mg of 
fixable FITC-dextran (Invitrogen) diluted in PBS to a volume of 100 il was injected 
intratumorally. At 2 h post-injection, tumours were removed and rapidly frozen in 
tissue-freezing medium. Experimental cohorts of at least three mice per pancreatic 
cancer cell line, each mouse implanted in both flanks, were used in triplicate 
experiments. To quantify FITC-dextran uptake in the tumours, the total particle 
area per cell was determined from at least five sections per tumour with five fields 
analysed per section, totalling at least 15 randomly selected fields per tumour. 

For the autochthonous model, pancreata from P48-cre; Isl-Kras?!?; Trp53/* 
(KPC) mice, at 12 weeks of age, were injected with 2 mg of FITC-dextran and 
subsequently collected. As a control, we analysed pancreatic tissue from wild-type 
mice. Frozen sections from sectioned pancreata were analysed using standard 
microscopic techniques and macropinocytosis-positive cells were identified by 
the visualization of FITC-positive puncta. 

To evaluate the effects of EIPA on macropinocytosis in vivo, mice heterotopically 
transplanted with MIA PaCa-2 cells, as described above, were treated with an 
osmotic pump (Alzet) when tumours attained an average volume of 50-100 mm? 
(approximately 2 weeks after transplantation). EIPA (20mg ml ' ina 1:4 dilution 
of DMSO and PBS) was administered at a pump rate of 0.11 ph '. Asa control, we 
used animals treated with vehicle only (DMSO in PBS). Animals were intratumo- 
rally injected with FITC-dextran at various time points (4-7 days) during the course 
of treatment. After a subsequent 1-h time interval, tumours were collected, sec- 
tioned and analysed using standard microscopic techniques. Each experimental 
group consisted of five animals and at least five sections per pancreas were analysed. 

For tumour growth assays, the animals were randomized into control and treated 
groups. Treatment, as described above, was initiated 2 weeks post-transplantation 
when the tumours had attained an average volume of 50-100 mm’. Volumes of the 
subcutaneous tumours were calculated on the basis of measurements obtained in a 
blinded fashion using digital calipers after 7 days of treatment. 

Metabolite quantification. NIH 3T3 Kras’'? cells were grown in DMEM with/ 
without the supplementation of 2% BSA for 24 h before collection. Cells were lysed 


in a buffered 1% Triton solution and lysates were de-proteinized using perchloric 
acid-precipitation (BioVision). Relative metabolite concentrations were then 
determined using colorimetric assay kits: glutamate/glutamine (Bioassay 
Systems) and o-ketoglutarate (BioVision). 

Generation of ‘C-labelled proteins. A prototrophic haploid SK1 strain of 
Saccharomyces cerevisiae was grown to an attenuance of 4.0 in synthetic complete 
media lacking amino acids (DIFCO) and containing 2% glucose that was either 
normal isotopic, or uniformly '*C-labelled (Cambridge Isotopes). Whole-cell 
extracts were prepared according to ref. 28. In brief, cells were collected, re-sus- 
pended in 100 mM HEPES, pH 7.6, 0.8 mM sorbitol, 10 mM magnesium acetate, 
2mM EDTA and 300 mM potassium glutamate and lysed using a Sample Prep 
6870 Freezer/Mill (SPEX CertiPrep). Lysate was treated with DNase (Ambion) 
and RNaseH (Invitrogen) and dialysed against PBS. Endogenous protease and 
glutaminase activity were eliminated by adding the protein to FBS and incubating 
for 30 min at 56°C. The resulting labelled and unlabelled protein was either: (1) 
tagged with N-hydroxysuccinimide-rhodamine (Pierce) according to manufac- 
turer’s instructions for visualization in confocal microscopy experiments, or (2) 
used to supplement cell culture media for subsequent tracing experiments. 
Metabolite extraction. For extracellular metabolite analysis, media was collected 
and metabolites extracted using 0.3 ml ice-cold acetone containing 1 j1g norvaline. 
Extracts clarified by centrifugation and the supernatant evaporated under nitro- 
gen. For intracellular metabolite analysis, cells were rinsed three times with cold 
saline and quenched with cold methanol. One half volume of cold water contain- 
ing 1 jg norvaline was added, cell lysate was collected, and one volume of chlo- 
roform was added to each sample. After extraction the aqueous phase was 
collected and evaporated under nitrogen. 

Gas chromatography/mass spectrometry (GC/MS) analysis. Dried polar 
metabolites were dissolved in 20 ul of 2% methoxyamine hydrochloride in 
pyridine (Thermo) and held at 37 °C for 1.5 h. After dissolution and reaction, tert- 
butyldimethylsilyl derivatization was initiated by adding 30 ul N-methyl-N-(tert- 
butyldimethylsilyl)trifluoroacetamide + 1% _ tert-butyldimethylchlorosilane 
(Regis) and incubating at 37°C for 1h. GC/MS analysis was performed using 
an Agilent 6890 GC equipped with a 30m DB-35MS capillary column connected 
to an Agilent 5975B MS operating under electron impact ionization at 70 eV. One 
microlitre of sample was injected in splitless mode at 270 °C, using helium as the 
carrier gas at a flow rate of 1 ml min '. For measurement of amino acids, the GC 
oven temperature was held at 100°C for 3min and increased to 300°C at 
3.5°Cmin '. The MS source and quadrupole were held at 230°C and 150°C, 
respectively, and the detector was run in scanning mode, recording ion abundance 
in the range of 100-605 m/z. MIDs were determined by integrating the appropriate 
ion fragments” listed in Supplementary Table 1 and corrected for natural isotope 
abundance using in house algorithms adapted from ref. 31. 
Glutamine-deprivation assays. Cells were plated in 96-well plate format at a 
density of 1,000-1,500 cells per well. Thirty-two hours after seeding, cells were 
rinsed briefly in PBS and incubated in the indicated glutamine-deprivation media. 
Glutamine-free DMEM media was supplemented to the indicated concentration 
of glutamine in the presence of 10% dialysed FBS and 25 mM HEPES. For some 
conditions, media was supplemented with BSA (Fraction V, fatty-acid-, nuclease- 
and protease-free, Calbiochem). The nominal 0.2% albumin concentration 
inherently present in complete media was supplemented such that the final con- 
centration of BSA in the media was 2%. For rescue experiments, dimethyl o-keto- 
glutarate (Sigma) was used at 7 mM”. For all experiments, media was replaced 
every 24h. Viable cell counts were obtained using the MTT assay and relative cell 
number was determined by Syto 60 staining. EIPA (Invitrogen) was diluted in 
DMSO and used at the indicated concentrations with vehicle-only controls con- 
sisting of DMSO alone. Images depicting cell number in 96-well format were 
obtained by staining with Syto 60 (Molecular Probes), a red fluorescent nucleic 
acid stain. Plates were then scanned with an Odyssey Imager (LI-COR) at 700 nm. 


30. Antoniewicz, M. R., Kelleher, J. K. & Stephanopoulos, G. Accurate assessment of 
amino acid mass isotopomer distributions for metabolic flux analysis. Anal. 
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Small molecule inhibition of the KRAS-PDEd 
interaction impairs oncogenic KRAS signalling 


Gunther Zimmermann!*, Bjorn Papke?*, Shehab Ismail**, Nachiket Vartak*, Anchal Chandra, Maike Hoffmann‘, 
Stephan A. Hahn*, Gemma Triola’, Alfred Wittinghofer®, Philippe I. H. Bastiaens*? & Herbert Waldmann’? 


The KRAS oncogene product is considered a major target in anti- 
cancer drug discovery’ *. However, direct interference with KRAS 
signalling has not yet led to clinically useful drugs* *. Correct local- 
ization and signalling by farnesylated KRAS is regulated by the 
prenyl-binding protein PDE6, which sustains the spatial organiza- 
tion of KRAS by facilitating its diffusion in the cytoplasm*"’. Here 
we report that interfering with binding of mammalian PDE to 
KRAS by means of small molecules provides a novel opportunity to 
suppress oncogenic RAS signalling by altering its localization to 
endomembranes. Biochemical screening and subsequent struc- 
ture-based hit optimization yielded inhibitors of the KRAS- 
PDE interaction that selectively bind to the prenyl-binding pocket 
of PDE6 with nanomolar affinity, inhibit oncogenic RAS signal- 
ling and suppress in vitro and in vivo proliferation of human 
pancreatic ductal adenocarcinoma cells that are dependent on 
oncogenic KRAS. Our findings may inspire novel drug discovery 
efforts aimed at the development of drugs targeting oncogenic 
RAS. 

The interaction between a biotinylated and farnesylated KRAS4B 
peptide’? with His-tagged PDES was used in an high-throughput 
Alpha Screen (Supplementary Fig. 1) to identify small molecules that 
bind to the farnesyl-binding pocket of PDE. The screen yielded several 
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benzimidazole hits (for example, 1, Fig. 1a) which were further char- 
acterized by means of a fluorescence polarization assay based on a 
known PDES ligand’® (Kp = 165 + 23nM for compound 1), iso- 
thermal titration calorimetry (Kp = 217 + 15nM for 1) and change 
in protein melting temperature upon interaction (see Supplementary 
Figs 2, 3 and 4)"*. 

Crystal structure analysis of 1 in complex with PDES at 1.87A 
resolution (Fig. 1b and Supplementary Fig. 5) and comparison with 
the previously obtained structure of the complex between PDE6 and 
farnesylated RHEB (root mean squared deviation (r.m.s.d.) of 0.9 A), 
revealed that two benzimidazoles bind into the hydrophobic tunnel in 
PDES (Fig. 1b). One molecule is deeply buried and overlaps with the 
farnesyl-binding site. The second molecule is located in the vicinity of 
the binding site that makes main chain contacts with two carboxy- 
terminal RHEB amino acids. Binding of the inhibitors is mediated by 
hydrophobic interactions and hydrogen bonding between the benzimi- 
dazole units and Tyr 149 and Arg61 (Fig. 1b). Consistently, related 
N-benzylated 2-phenylindole shows no binding (Supplementary Fig. 3). 
Furthermore, the side chain of Trp90 underwent a conformational 
change resulting in a T stacking with inhibitor 1 (Supplementary Fig. 6). 

Structure-guided design based on the crystal structure obtained for 1 
in complex with PDE6 suggested covalent linking of the benzimidazoles 


Figure 1 | Structure-based 
development of inhibitors. 

a, Structure and binding affinities of 
benzimidazole compounds 1-6 as 
determined by competitive 
fluorescence polarization assay (see 
Supplementary Information, 
Supplementary Fig. 3). b, Ribbon 
diagram of PDES structure in 
complex with 1 (yellow), and overlay 
with the previously obtained crystal 
structure of farnesylated RHEB 
peptide with PDE6 (cyan). Small 
molecule 1 (green) and farnesyl 
group (red) are shown as ball and 
sticks. Hydrogen bonding 
interactions between two molecules 
of 1 and Tyr 149 and Arg 61 in the 
co-crystal structure are highlighted. 
c, Structure of 2 (orange sticks) in 
complex with PDES. Overlaid is the 
structure of two molecules of 1 (faint 
grey sticks) in complex with PDE8. 
d, Crystal structure of (S)-5 in 
complex with PDE6 confirms the 
presence of a hydrogen bond 
between the piperidine and the 
backbone carbonyl of Cys 56. 
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Figure 2 | In-cell measurements of the effect of 
deltarasin on the interaction of RAS with PDEd 
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and resulting delocalization of KRAS. a, FLIM 
time series on MDCK cells expressing mCitrine- 
RHEB and mCherry-PDE6 show a loss of 
interaction between RHEB and PDE6 after 
treatment with 5 1M deltarasin. Left panel, 
representative sample of FLIM time series. Upper 
two rows show fluorescence intensity distribution 
of the indicated fluorescent fusion proteins, 
whereas the lower two rows show maps of average 
fluorescence lifetime (t,y) in ns and computed 
molar fraction («) of interacting mCitrine-RHEB 
with mCherry—PDE5. Time in minutes is indicated 
above the panels. The deltarasin-induced 
dissociation of mCherry-PDE6 and mCitrine- 
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shows fluorescence intensity distribution of 
mCitrine-RHEB, middle row shows average 
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fluorescence lifetime (T,,) in ns and lower row 

shows molar fraction («) of interacting mCitrine- 
RHEB with mCherry—PDES. The concentration of 
deltarasin is indicated at the top of the panel in nM. 
Right panel, fit of averaged dose-response + s.e.m. 
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c, Time series of mCitrine-KRAS redistribution 
upon application of 5 uM of deltarasin in PANC-1 
(upper panel) and Panc-Tu-I (lower panel) cells. 
Time in minutes is indicated above the panel. The 
first and last time point of each cell line were used 
to quantify the mCitrine-KRAS distribution in 
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and synthesis of a focused library (Fig. 1a) with variation of the linker 
structure (see the Supplementary Information). Covalent linkage via an 
ether bond for instance yielded dimeric compound 2 (Fig. 1a), which 
binds to PDE6 with significantly increased affinity (Kp = 39 + 11nM) 
and with an almost complete overlap with the positions of the indi- 
vidual benzimidazoles (Fig. 1c). 

Investigation of the crystal structure of the complex (Fig. 1c) indicated 
replacement of the allyl group at R for a larger cyclohexyl moiety, which 
led to increased affinity (3, Ky = 16 + 2 nM, Fig. 1a), whereas introduc- 
tion of a negative charge (R = CH,COOH, Kp = 870 + 290 nM, see 
also Supplementary Table 1), omission of a substituent (R= H, 
Kp = 116 + 29nM) or increased steric bulk (R = Boc(4-piperidine), 
Kp > 2000 nM), yielded less potent ligands. 

The proximity of the backbone carbonyl of Cys 56 to substituent 
R (Fig. 1c) suggested introduction of a piperidine to introduce an 
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localization can be seen in the mCitrine-KRAS 
intensity profiles along the white lines in the 
fluorescence micrographs in the right panels. A.U., 
arbitrary units. d, Immunofluorescence staining of 
fixed and permeabilized PDAC cells with a pan 
antibody against RAS (Calbiochem, Anti-Pan-RAS), 
2h after administration of the vehicle DMSO, 

200 nM and 5 uM of deltarasin. e, FRET-FLIM 
measurements of the interaction between mTFP- 
PDE6 and TAMRA-~deltarasin. Upper row, 
fluorescence intensity of mTFP-PDE6; lower row, 
average fluorescence lifetime (t,,) map of mTFP- 
PDE6 alone and in complex with the TAMRA- 
deltarasin. The averaged drop in the fluorescence 
lifetime (<t,,> = s.d.) of mTFP-PDES6 due to FRET 
with TAMRA~deltarasin is presented in the bar 
graph at the right side for N = 3 in each condition. 
Scale bars in all micrographs indicate 10 jim. 
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additional hydrogen bond (Fig. 1c). However, piperidine-containing 
compound 4 (Fig. la) had lower affinity compared to 3, probably 
due to rigidity of the linker that might not allow for three hydrogen- 
bonding interactions. 

Replacement of the phenyl ether by a flexible piperidine 4-carboxylic 
acid ester resulted in an improvement of Kp by almost one order of 
magnitude (5, Kp = 10 + 3 nM, Fig. 1a). Crystal structure analysis of 5 
in complex with PDE6 confirmed the presence of a hydrogen bond 
between the piperidine moiety in 5 and the carbonyl backbone of 
Cys 56 (Fig. 1d). Replacement of the benzyl moiety with a 3-methyl 
thiophene lead to a similarly potent compound (6, Kp = 9 + 2nM, 
Fig. 1a). 

Separation of the enantiomers of Boc-protected 4 and 6 by prepar- 
ative chiral high-performance liquid chromatography (Supplementary 
Fig. 7), removal of the Boc group and biochemical investigation of the 
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Figure 3 | Inhibition of PDE5-KRAS interaction suppresses proliferation 
and MAPK-signalling in oncogenic KRAS-dependent PDAC cells. a, Real- 
time cell analysis (RTCA) of oncogenic KRAS-dependent Panc-Tu-I (upper 
panel) and oncogenic RAS-independent PANC-1 (lower panel) PDAC cell 
proliferation after doxycycline-induced PDES knock-down. Cells were 
transduced with shPDES-572 doxycycline-inducible knockdown vector. Cell 
indices + s.d. were measured in duplicates. Cells were treated with doxycycline 
to induce PDE6 knockdown from the beginning of the experiment (+ Dox). 
b, RTCA of deltarasin dose PDAC cell proliferation response + s.d. of 
oncogenic KRAS-dependent (Panc-Tu-I, upper panel) and KRAS-independent 
(PANC-1, lower panel) cell lines shows deltarasin-induced suppression of 
proliferation in oncogenic KRAS-dependent Panc-Tu-I cells. Deltarasin was 
added at the indicated time point (arrow) and concentration (1-9 uM). The 
inset in each panel shows deltarasin dose versus growth-rate response + s.d. as 
determined from the average of the first derivative of the cell growth curves 


pure enantiomers revealed a four to sixfold difference in potency for 
each pair of enantiomers ((S)-4 Kp =38+16nM vs (R)-4 Kp = 
190 + 55nM), and ((S)-6 Kp = 7 + 3 nM vs (R)-6 Kp = 39 + 18 nM. 
The absolute configuration was assigned by analogy to a previously 
reported synthesis (see the Supplementary Information)'*. Tight 
binding of (S)-4 and (S)-6 to PDE6 was verified by means of a direct 
titration using 5-carboxytetramethyl rhodamine (TAMRA)-labelled 
(S)-4 (Kp =7.6+1.3nM) and TAMRA-~(S)-6 (Kp = 5.3 + 1.5nM, 
Supplementary Fig. 8). TAMRA~(S)-4 and TAMRA-~(S)-6 did not bind 
to the PDEd-homologous GDI-like solubilizing factors HRG4/Unc119a 
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determined between 35 and 65h (Supplementary Fig. 15). c, EGF-induced 
MAPK signalling response in PDAC cells treated with deltarasin. Peak 
normalized Erk1/Erk2 phosphorylation time profiles upon stimulation with 
200 ng ml! EGF in serum starved Panc-Tu-I and PANC-1 cells as quantified 
from three independent western blots for each cell line. Each western blot 
contained the Erk1/Erk2 time response with vehicle control and deltarasin 
(Supplementary Fig. 18). The average +s.d. is shown for each time point. Black, 
vehicle control DMSO; red, 2-h incubation with 5 uM deltarasin before EGF 
administration. d, Deltarasin induces cell death in KRAS-dependent PDAC 
cells (Panc-Tu-I, Capan-1) as measured by an annexin-V/propidium iodide 
FACS analysis (Supplementary Fig. 16). The bar graph shows the average +s.d. 
of three independent experiments for each cell line and condition. Cells were 
analysed by FACS after 24h of vehicle DMSO (black) and 5 uM deltarasin 
incubation (red). 


and Unc119b'*”, nor to the presumed prenyl-binding proteins galectin- 
1 and galectin-3 (refs 18-20), indicating specificity for PDES (Sup- 
plementary Fig. 9). Compound 4 contains a hydrolytically stable ether 
linker (as opposed to an ester in 6) and has appreciable solubility 
and membrane permeability (Supplementary Information), (S)-4 was 
therefore used in the following cell biological studies. We term this 
compound deltarasin. 

Fluorescence lifetime imaging microscopy (FLIM)-based fluor- 
escence resonance energy transfer (FRET) measurements”! were used 
to address whether deltarasin affected the interaction of PDE6 with 
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KRAS in live cells. To this end, mCherry-PDE6 was expressed together 
with mCitrine-tagged farnesylated RAS variants (KRAS6Q mutant or 
the RAS family protein RHEB) that lack part of the polybasic plasma 
membrane interaction motif of KRAS. Thus, their soluble fraction and 
interaction with PDES are strongly enhanced). This largely facilitates 
the detection of the effect of small molecules on the interaction 
between mCitrine-tagged farnesylated RAS and mCherry-PDE® in 
the cytoplasm of live cells by FLIM. The fluorescence patterns of 
mCitrine-RHEB or mCitrine-KRAS6Q and mCherry-PDES in 
Madine-Darby canine kidney (MDCK) cells transiently co-transfected 
with these proteins were homogeneous, showing a clear solubilization 
of mCitrine-RHEB/KRAS6Q by mCherry-PDE%. A substantial drop 
in the mCitrine fluorescence lifetime showed that FRET occurred 
between the fluorescent proteins and corroborated that the solubiliza- 
tion of RHEB/KRAS6Q was due to a direct interaction with PDE6 
(Fig. 2a and Supplementary Fig. 10). Computation of the fraction of 
interacting molecules («) by global analysis of the fluorescence decay 
profiles as obtained by FLIM*'” showed that a significant fraction of 
mCitrine-RHEB/KRAS6Q was in complex with mCherry-PDE5. 
Within a minute, 5 uM of deltarasin completely inhibited this inter- 
action and released the insolubilized mCitrine-RHEB/KRAS6Q to 
the endomembrane system (Fig. 2a and Supplementary Fig. 10). 
This shows that deltarasin interferes with the binding of KRAS to 
PDES in cells and thereby inhibits its solubilization. Quantification 
of the interaction between mCherry-PDE6 and mCitrine-RHEB (or 
mCitrine-KRAS6Q) in live cells by global analysis of FLIM data as 
function of deltarasin dose (Fig. 2b and Supplementary Fig. 11) enabled 
the computation of an ‘in cell’ Kp of 41 = 12nM (27 +7nM for the 
mCitrine-KRAS6Q assay) for the PDEdé-deltarasin interaction, 
remarkably similar to that determined for binding of deltarasin to 
purified PDE6 (38 + 16nM). 

By analogy to PDES knockdown”, treatment of human pancreatic 
ductal adenocarcinoma (PDAC) cell models PANC-1 and Panc-Tu-I 
with 5 uM deltarasin led within 1h to a clear loss of mCitrine-KRAS 
(ectopically expressed) plasma membrane localization and its redis- 
tribution to endomembranes (Fig. 2c). Immunofluorescence staining 
of this type of PDAC cells with an anti-RAS antibody also showed that 
deltarasin induced a random distribution of endogenous RAS to endo- 
membranes (Fig. 2d). Direct binding of deltarasin to PDE6 in live 
PANC-1 cells could also be demonstrated by the occurrence of 
FRET between monomeric teal fluorescent protein-tagged PDE6 
(mTFP-PDES) and TAMRA-tagged deltarasin, as seen by the signifi- 
cant reduction in fluorescence lifetime of mTFP in the presence of 
TAMRA-deltarasin (Fig. 2e). These experiments show that inhibition 
of the PDES-KRAS interaction by binding of deltarasin to PDE6 leads 
to aloss of KRAS spatial organization in PDAC cells, as maintained by 
the solubilizing activity of PDEO’. 

To assess the effect of deltarasin on oncogenic KRAS signalling, the 
growth of PDAC cells that are dependent on oncogenic KRAS for their 
survival (Panc-Tu-I and Capan-1 cells) was compared to PDAC cells 
that are independent of oncogenic KRAS (PANC-1 cells)** or express 
wild-type KRAS (BxPC-3 cells)”. Transfection of these PDAC cells 
with a lentiviral construct for doxycycline-inducible short hairpin 
RNA (shRNA) expression against PDES (Supplementary Figs 12 
and 13) and treatment with doxycycline resulted in strongly reduced 
cell proliferation and cell death of the KRAS-dependent Panc-Tu-I and 
Capan-1 PDAC cell lines after 2-4 days and had only a slight effect on 
the growth of the other cell lines (Fig. 3a and Supplementary Fig. 14a). 
A similar effect on the proliferation of KRAS-dependent PDAC 
cells was found with the PDE6 inhibitor deltarasin (Fig. 3b and 
Supplementary Fig. 14b). The PDAC proliferation profiles as function 
of deltarasin dose (1-9 uM) show that the KRAS-dependent Panc-Tu- 
I cells show strongly reduced proliferation and increased cell death at 
around 5 uM deltarasin, whereas KRAS-independent PANC-1 cells 
only have a small negative dependence of growth rate on deltarasin 
concentration (Fig. 3b, d and Supplementary Figs 15 and 16). The 
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KRAS-dependent Capan-1 cells also showed strongly reduced prolife- 
ration and increased cell death within hours upon treatment with 5 uM 
deltarasin, whereas the wild-type KRAS-harbouring BxPC-3 PDAC 
cells only had a slightly reduced initial growth rate and little or no 
increase in cell death with respect to the control (Fig. 3d and Sup- 
plementary Figs 14 and 16). These experiments indicate that the 
reduced proliferation of KRAS-dependent PDAC cells is caused in 
part by attenuated survival signalling from oncogenic KRAS that 
is delocalized on endomembranes. The deltarasin concentration 
(~3 UM, Fig. 3b) that induced a measurable effect on the proliferation 
of Panc-Tu-I cells was higher than that of complete inhibition of the 
interaction between PDE6 and RAS (~200 nM, Fig. 2b). Because del- 
tarasin was found to be stable to metabolism by PDAC cells 
(Supplementary Fig. 17), this is probably due to the activity of ABC- 
transporters*® that oppose the inward flux of the compound and 
thereby lower its intracellular availability at longer times. 

To address whether KRAS relocalization by deltarasin-mediated 
PDES inhibition has an effect on oncogenic KRAS signal transduction 
by uncoupling it from its effectors at the plasma membrane’, the 
epidermal growth factor (EGF)-induced mitogen-activated protein 
kinase (MAPK) response in serum-starved PDAC cells was studied. 
Quantitative western blot analysis of the phosphorylated Erkl and 
Erk2 MAPKs showed that the KRAS-dependent Panc-Tu-I cells 
had a high basal Erk activity that was reduced upon inhibition of 
PDES by deltarasin (Fig. 3c and Supplementary Fig. 18). Strikingly, 
the two oncogenic KRAS-dependent PDAC cell lines (Panc-Tu-I 
and Capan-1) showed a substantial reduction in the EGF-mediated 
transient MAPK signal response as compared to the other cell lines 
(Fig. 3c and Supplementary Figs 14 and 18). The residual transient 
MAPK response to EGF can probably be attributed to the remaining 
wild-type RAS isoforms at the plasma membrane that are encoded by 
the healthy (wild-type) alleles. Therefore, oncogenic KRAS-dependent 
proliferative and survival signalling is strongly attenuated by the loss of 
KRAS plasma membrane localization as caused by the inhibition of 
PDE6-KRAS interaction by deltarasin. 

To evaluate the anti-tumour activity of deltarasin, we administered 
the compound to nude mice bearing subcutaneous human Panc-Tu-I 
tumour cell xenografts and monitored tumour growth rate. Deltarasin 
was administered by intra-peritoneal (ip.) injection once (QD) or 
twice (BID) per day. Three dosage regiments were tested (10 mgkg ' 
QD, 15mgkg ' QD and 10 mgkg ' BID). An initial ~15% maximal 
weight loss of mice injected with deltarasin could be observed during the 
first 2 days of treatment, after which this stabilized in all groups (Sup- 
plementary Fig. 19). A clear dose-dependent reduction in Panc-Tu-I 
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Figure 4 | Deltarasin impairs dose-dependent in vivo growth of xenografted 
pancreatic carcinoma in nude mice. a, b, Tumour volume measurements 
(a) and tumour volume distribution (b) at day 9 of Panc-Tu-I xenograft 
tumours treated with vehicle or deltarasin at the dosages indicated: deltarasin 
was administered by intra-peritoneal injection once (QD) or twice (BID) per 
day at 10mgkg ‘QD, 15mgkg ' QD and 10mgkg * BID. Changes in mean 
tumour volumes are given relative to the volumes at treatment initiation. Error 
bars represent s.e.m. with n = 9 for the control, 10 mg kg! QD, and 
15mgkg * QD and n= 8 for the 10mgkg ’ BID group. P values were 
obtained by unpaired t-test. **P = 0.01, *P=0.05. 
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tumour growth rate could be observed in deltarasin treated mice with 
respect to the vehicle-injected controls, where the growth of tumours in 
mice that were treated with 10 mg kg ' BID deltarasin was almost com- 
pletely blocked (Fig. 4a). The negative effect of the compound on Panc- 
Tu-I tumour growth could also be observed in the reduced variance in 
tumour size with respect to the control as measured at day 9 (Fig. 4b). 

Our results demonstrate that inhibition of the PDEd-KRAS inter- 
action by means of small molecules affects the spatial organization of 
KRAS and thus provides a novel opportunity to suppress oncogenic 
RAS signalling and thereby tumour growth. 


METHODS SUMMARY 


Screening based on Alpha-technology was conducted in white, non-binding 
1,536-well plates in a final volume of 6 ul (Hiss-PDES, 100 nM, biotinylated 
KRAS peptide 250 nM). 

Kp values were measured by fluorescence polarization measurements. For 
direct titrations, increasing amounts of PDE6 were added to a solution containing 
50-100 nM labelled small molecule in 200 pil PBS buffer. For displacement titra- 
tions, increasing amounts of the small molecules in DMSO were directly added to 
fluorescein-labelled atorvastatin (24 nM) and His,-tagged PDE6 (40 nM) in 200 pl 
PBS-buffer (containing 0.05% CHAPS, 1% DMSO), keeping the concentration of 
fluorescein-labelled atorvastatin, PDE5 and DMSO constant. 

For Kp measurements using isothermal titration calorimetry, PDE6 protein 
(280 1M) was titrated to small molecule (30 1M) in Tris/HCl buffer (temperature 
25°C). In the T,, shift assays, protein melting points were detected by circular 
dichroism spectroscopy in the presence of small molecules. 

Inhibitors were co-crystallized with PDE by mixing a solution of small mole- 
cule and PDE6 with DMSO 1.7% as a final concentration. Crystals were obtained 
from a Qiagen crystallization screen. 

For live-cell microscopy, cells were grown in four-well Lab-Tek chambers 
(NUNC) and transferred to low-bicarbonate DMEM without phenol red supple- 
mented with 25 mM HEPES at pH 7.4. 

The following antibodies were used for western blotting: total Erk (Abcam- 
AB36991; 1:2,000), pErk (Cell Signaling-9101; 1:2,000) and infrared secondary 
antibodies (LI-COR). 

Fluorescence lifetime images were acquired using a confocal laser-scanning 
microscope (FV1000, Olympus) equipped with a time-correlated single-photon 
counting module (LSM Upgrade Kit, Picoquant). Intensity thresholds were 
applied to segment the cells from the background fluorescence. Data were further 
analysed as described in ref. 23 to obtain images of the molar fraction («) of 
interacting mCherry-PDE6/mCitrine-RHEB. 

Real-time cell analyses were carried out by monitoring the proliferation for at 
least 3 days after administration of the inhibitor. 
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Mature HIV-1 capsid structure by cryo-electron 
microscopy and all-atom molecular dynamics 


Gongpu Zhao'*, Juan R. Perilla**, Ernest L. Yufenyuy***, Xin Meng"?, Bo Chen®, Jiying Ning’, Jinwoo Ahn'”, 
Angela M. Gronenborn'”, Klaus Schulten’, Christopher Aiken** & Peijun Zhang'*® 


Retroviral capsid proteins are conserved structurally but assemble 
into different morphologies’. The mature human immunodeficiency 
virus-1 (HIV-1) capsid is best described by a ‘fullerene cone’ model”*, 
in which hexamers of the capsid protein are linked to form a hexa- 
gonal surface lattice that is closed by incorporating 12 capsid-protein 
pentamers. HIV-1 capsid protein contains an amino-terminal 
domain (NTD) comprising seven a-helices and a B-hairpin*’, a 
carboxy-terminal domain (CTD) comprising four a-helices®’, and 
a flexible linker with a 3,9-helix connecting the two structural 
domains®*. Structures of the capsid-protein assembly units have 
been determined by X-ray crystallography*”®; however, structural 
information regarding the assembled capsid and the contacts 
between the assembly units is incomplete. Here we report the cryo- 
electron microscopy structure of a tubular HIV-1 capsid-protein 
assembly at 8 A resolution and the three-dimensional structure of a 
native HIV-1 core by cryo-electron tomography. The structure of the 
tubular assembly shows, at the three-fold interface'', a three-helix 
bundle with critical hydrophobic interactions. Mutagenesis studies 
confirm that hydrophobic residues in the centre of the three-helix 
bundle are crucial for capsid assembly and stability, and for viral 
infectivity. The cryo-electron-microscopy structures enable mod- 
elling by large-scale molecular dynamics simulation, resulting in 
all-atom models for the hexamer-of-hexamer and pentamer-of- 
hexamer elements as well as for the entire capsid. Incorporation of 
pentamers results in closer trimer contacts and induces acute surface 
curvature. The complete atomic HIV-1 capsid model provides a plat- 
form for further studies of capsid function and for targeted phar- 
macological intervention. 

HIV-1 capsid protein (CA) spontaneously assembles in vitro into 
helical tubes and cones that resemble authentic viral capsids’, allowing 
direct structural investigation of the CA assembly using cryo-electron 
microscopy (cryo-EM). Previous studies of helical tubes provided low- 
resolution views of the curved hexagonal lattice, which is stabilized by 
important inter-hexamer CTD-CTD contacts across local two- and 
three-fold axes*"’. However, accurate and reliable atomic modelling of 
the HIV-1 capsid assembly has not been possible, owing to limited 
resolution (up to 16 A). In fact, high-resolution structural determina- 
tion of in vitro assembled samples is very challenging: HIV-1 capsid 
assembly in vitro is inefficient, requires high salt (1 M), and exhibits 
great variability in tube diameters and helical symmetries (Sup- 
plementary Fig. 1a)*''. We devised a rapid back-dilution method to 
improve cryo-EM sample quality and recorded more than 1,500 cryo- 
EM images on film, many of which showed diffraction layer lines to 
10A resolution. Using 27 high-resolution tubes in a single helical 
symmetry (—12, 11), along with a real-space, single-particle, helical 
reconstruction method’*"’, we determined the structure of the CA 
tubular assembly at 8.6 A resolution (Fig. la-c and Supplementary 
Fig. 1b-d). 


The density map clearly delineates all the o-helical motifs in the 
assembled structure (Fig. 1b-d), including the CTD dimerization 
helix, H9, with a distinct kink in the middle (Fig. 1d). The linker region 
(residues 146-150), followed by a 3, 9-helix, shows strong electron 
density (Fig. 1d), suggesting that it adopts a relatively ordered and 
stable conformation. This linker region is a critical determinant for 
proper capsid assembly and stability*. The map also shows detailed 
intermolecular interactions at all four interfaces critical for assembly: 
the inter-hexamer CTD interfaces at the pseudo-two-fold and pseudo- 
three-fold axes, and the hexamer forming NTD-NTD and NID-CTD 
interfaces (Fig. 1c and Supplementary Fig. 1c)?""""*. 

The 8 A resolution density map allowed reliable atomic modelling 
for the entire assembly, using molecular dynamics flexible fitting 
(MDFF)'*. We docked atomic structures (3H47 for CA NTD® and 
2KOD for CA CTD") into the electron density, built the linker and 
missing loops using homology modelling, and then applied MDFF 
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Figure 1 | Cryo-EM reconstruction of HIV-1 CA tubular assembly at 8 A 
resolution and MDFF. a, A cryo-EM image of recombinant A92E CA tubular 
assembly. Scale bar, 100 nm. b, Electron density map of the A92E CA tube with 
(—12, 11) helical symmetry. Yellow arrows indicate pairs of helix H9, located 
between adjacent hexamers. c, MDFF model of the HIV-1 capsid assembly, 
superimposed with the electron density map contoured at 4.00. Three CA 
hexamers, with NTDs (blue) and CTDs (orange), are shown. d, MDFF model of 
a CA monomer viewed from two angles. e, Two CTD dimer structures along -1 
(orange) and 11 (yellow) helical directions, superimposed onto the NMR 
solution dimer structure (grey, 2KOD). 
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(Supplementary Video 1). The resulting atomic model for the tubular 
assembly comprises 71 CA hexamers and 13 million atoms, including 
water molecules and ions. The final model fits the experimental map 
remarkably well, as indicated by the excellent overlap between «-helices 
and their corresponding rod-like densities (Fig. 1c, d), with an overall 
cross-correlation function value of 0.96. All the densities are accounted 
for in the model, including those of the linker region (Fig. 1d) and 
B-hairpin region. An equivalent pseudo-atomic model was obtained 
independently, using a normal mode flexible-fitting method (Sup- 
plementary Fig. 2a). 

The MDFF model of the full CA assembly contains an almost 
invariant hexameric NTD structure with an average root mean 
squared deviation of 1.5 + 1.0A for backbone atoms between all 71 
hexamers. The hexameric arrangement of the NTD is very similar to 
that observed in the crystal structure of cross-linked hexameric CA 
protein’ (Supplementary Fig. 2b). Further, the intermolecular interac- 
tions in a single hexameric unit are the same as those observed in the 
crystal structure (Supplementary Fig. 2c, d). Considering that retro- 
viral CA proteins share a common NTD fold and all form hexamers, 
hexameric CA NTD appears to constitute a building block in the final 
assembly. 

In contrast, the CTD dimers, which connect adjacent assembly units 
on the inner, more curved surface, are more variable than the NTD 
interfaces, with a root mean squared deviation of 3.0A + 0.9 A for all 
71 dimers. Three distinct CTD dimer conformations are seen in the 
assembled helical tube, with dimerization helix H9 crossing angles of 
36°, 44° or 54°. Several conformers for the CTD dimer and full-length 
dimeric CA were previously observed in solution’®, demonstrating the 
plastic nature of the dimer interface. Nonetheless, key contacts are 
formed by the hydrophobic residues, W184 and M185, albeit with 
small variations in their local environment (Fig. le). The CTD girdle 
of the CA hexamer, helices H9 and H10 in particular, seems more 
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Figure 2 | Mutational analysis of the hydrophobic trimer interface. 

a, Detailed view of the trimer interface, with the structural model superimposed 
on the density map (grey mesh, contoured at 3.5a). Selected residues are 
depicted in stick-and-ball representation. b, Virus infectivity (blue) and capsid 
stability (black), as percentage of total CA, + standard deviation (n = 3), of 
wild-type (WT) and CA trimer interface mutants. c, Spontaneous disulphide 
crosslinking of A204C mature and maturation-defective virions analysed by 
immunoblotting for CA. d, In vitro assembly of recombinant wild-type and 
A204C CA proteins, assayed with high-speed sedimentation and 
polyacrylamide gel electrophoresis analysis. Letters u, s and p denote the 
unassembled reaction mixture (u) and the supernatant (s) and pellet (p) after 
assembly. e, Cryo-EM image of A204C in vitro assembly. Scale bar, 100 nm. 
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variable, compared with the CTD seen in the crystal structure 
(Supplementary Fig. 2b, e). Thus, the curvature needed for the helical 
tube assembly is accommodated by variable orientations in the CTD, 
mediated by the intrinsic flexibility of helix H9 in the monomer. 
Important new structural detail is seen at the CTD trimer interface. 
The MDFF model of the tubular assembly shows that this interface 
involves a patch of hydrophobic residues, specifically 1201, L202, A204 
and L205, all situated on one face of the amphipathic CTD helix H10 
(Supplementary Fig. 3a, b). At the centre of the hydrophobic core 
reside residues 1201 and A204 (Fig. 2a and Supplementary Fig. 3b). 
These residues contribute a large portion of the total buried surface 
area (8,758 A”) at the pseudo-three-fold axis. The central hydrophobic 
core is surrounded by polar residues of opposite charges, for example 
K203 and E213 (Figs 2a and 3c), further stabilizing the interface. 
Residues K203 and E213 are highly conserved and were previously 
shown to be critical for optimal HIV-1 capsid stability and for viral 
infectivity'’’’. To test further the involvement of the hydrophobic 
interactions in HIV-1 capsid function, we performed mutagenesis 
and chemical crosslinking studies. Remarkably, hydrophobic replace- 
ments (1201V/L, A204V/L and L205V/I) largely retained the infecti- 
vity and capsid stability (Fig. 2b)'*, but mutant virions with polar 
residue substitutions (1201D, A204D and L205D) were non-infectious 
(Fig. 2b) and contained unstable or abnormal cores (Supplementary 
Fig. 3c-g). Interestingly, these mutations had no apparent effect on the 
morphology of immature virions (Supplementary Fig. 3h-j). Purified 
CA proteins with the same polar substitutions failed to assemble in 
vitro (Supplementary Fig. 3m, n). The A204C CA mutant, which exhi- 
bits a Cys-Cys backbone distance of 5.3 A between adjacent molecules 
at the pseudo-three-fold axis, exhibited disulphide crosslinking of CA 
into dimers, in vitro and in mature virions (Fig. 2c). This was not the 
case for maturation-defective virions in which cleavage of nucleocap- 
sid protein was prohibited (Fig. 2c), implying that the hydrophobic 
CTD trimer interface is specific to the mature capsid’”. In addition, the 
A204C CA protein exhibited higher assembly efficiency (>90%) com- 
pared with wild-type CA (10-15%) (Fig. 2d) and assembled under 
physiological salt concentrations. More intriguingly, the A204C 
protein (and A204L) assembled into closed cones and short tubes that 
closely resembled authentic lentiviral cores (Fig. 2e and Supplemen- 
tary Fig. 3], 0), rather than the long tubes typically formed by wild-type 


Figure 3 | All-atom molecular dynamics simulation of CA tubular 
assembly. a, All-atom tubular assembly model comprising 71 CA hexamers 
(blue NTD and orange CTD) equilibrated for 125 ns. b, Ribbon representation 
of the tubular assembly, highlighting an HOH, circled area in a, superimposed 
on the density map (grey). ¢, Surface representation of the trimer interface 
(circled area in b) in a 425 ns equilibrated HOH model (hydrophobic, polar, 
negative and positive residues are grey, green, red and blue, respectively). 

d, Stereo view of a POH model after 1.5 [1s equilibration. A pentamer (orange) is 
surrounded by five hexamers (black). e, f, Superposition of CTD H9 dimer 
(e) or H10 trimer (f) interfaces from HOH (blue) and POH (orange). 
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CA (Supplementary Fig. 3k). This suggests that CA pentamers are 
readily incorporated in A204C assemblies. Consistent with the in vitro 
results, virions carrying the A204C substitution showed enhanced 
capsid stability and reduced infectivity (Fig. 2b). These results indicate 
that hydrophobic interactions at the CTD trimer interface play a criti- 
cal role in mature capsid assembly and stability. This role is further 
supported by semi-three-dimensional Monte Carlo simulations of 36 
CA dimers, on the basis of the assembly phase diagram established 
previously (Supplementary Fig. 4)°. 

A ‘fullerene cone’ model of the HIV-1 capsid requires insertion of 12 
CA pentamers into the hexagonal surface lattice to close the ovoid. 
Although the atomic structures of pentameric and hexameric HIV-1 
CA mutants have been determined””, and their structural similarity 
suggests that they are quasi-equivalent’®, detailed contacts between 
pentamers and hexamers for building a closed fullerene cone are not 
known. On the basis of the hexamer-of-hexamers (HOH) motif (six 
hexamers surrounding a central hexamer) extracted from the helical 
tube assembly (Fig. 3a, b), we generated a model for a pentamer of 
hexamers (POH; five hexamers encompassing a central pentamer), by 
replacing the central hexamer with a pentamer. The resulting model 
contains gaps between the surrounding hexamers because of the 
deletion of one hexamer, and poor dimer and trimer interactions 
(Supplementary Video 2). However, during a molecular dynamics 
simulation (Supplementary Table 1), these gaps closed quickly (Sup- 
plementary Video 2), converting the relatively flat starting model into 
a highly curved dome-like structure (Fig. 3d, Supplementary Fig. 5a, b 
and Supplementary Video 3). More specifically, sharp bite angles 
between adjacent subunits were formed, namely 137° and 147° for 
pentamer-hexamer and hexamer-hexamer edges, respectively, com- 
pared with 167° and 147° in the case of HOH (Supplementary Fig. 5b). 
The sharp bite angle closely matches the 135° value seen in the full- 
erene cone geometric model”’. All pre-existing intra-hexamer (or pen- 
tamer) and CTD dimer intermolecular interfaces were preserved in the 
simulation, whereas new dimer and trimer interactions between hex- 
amers and between pentamer and hexamers were formed in 300 ns 
(Fig. 3d and Supplementary Video 2), validating the previously sug- 
gested quasi-equivalence in the capsid lattice’. Although the CTD 
dimer interface remained slightly variable, as in the HOH model 
(Fig. 3e), the three-helix bundle at the CTD trimer interface was con- 
served but more closely packed in the POH (Fig. 3fand Supplementary 
Fig. 5c, orange) compared with the HOH (Fig. 3f, blue). This is intri- 
guing and suggests that enhancing the interaction at the three-fold 
axis, for example by A204C crosslinking, facilitates incorporation of 
pentamers into the assembly (Fig. 2e). Electrostatic destabilization at 
the six-fold axis in CA NTD’”, counterbalanced by the stabilizing 
hydrophobic interactions at the three-fold axis, probably controls pen- 
tamer incorporation. Given that the trimer contact is absent in the 
planar two-dimensional crystal of the CA assembly”, but clearly pre- 
sent in the helical tubular assembly, and even tighter in the highly 
curved POH model, we suggest that the three-fold interface is essential 
for the curved asymmetric assembly of the conical capsid. 

The HOH and POH assembly structures allow construction of an 
improved geometric HIV-1 capsid model”. Building a realistic atomic 
model of the HIV-1 capsid, however, requires an experimentally deter- 
mined native HIV-1 core structure. Because of the non-uniform, 
asymmetric nature of the native cores, we performed structural ana- 
lysis by cryo-electron tomography (cryo-ET) of HIV-1 cores isolated 
from virions (Supplementary Fig. 6 and Supplementary Video 4). The 
tomographic slices of a best-quality HIV-1 core, with a cone angle 
of 23° and representing the most common core™* (Supplementary 
Fig. 6b), clearly exhibit arrays of CA hexamers (Fig. 4a, red arrows) 
and declinations made by CA pentamers (Fig. 4b, yellow stars) at the 
outer capsid density layer. The native core also includes internal densi- 
ties corresponding to the viral RNA genome and proteins enclosed 
within the capsid (Fig. 4a, b). At the current tomography resolution, 
individual hexamers or pentamers in the capsid cannot be discerned. 
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Figure 4 | All-atom HIV-1 capsid model. a, b, Cryo-ET analysis of an 
isolated, native HIV-1 core, shown as two representative slices through the 
three-dimensional volume. Red arrows indicate arrays of CA hexamers; yellow 
stars indicate locations of sharp curvature change. Scale bar, 20 nm. c, A 
fullerene cone model (216H + 12P, green inset) matches the shape and size of 
the capsid, shown by the overlay of densities from the segmented capsid 
(orange contour) and the fullerene model (green). HIV-1 core internal densities 
are shown in light grey. d, Stereo view of the final molecular dynamics 
equilibrated all-atom capsid model (model I, see text) comprising 216 CA 
hexamers (blue, NTD; orange, CTD) and 12 CA pentamers (green). 


Thus, guided by the shape, size and structural features of the capsid 
layer, extensive fullerene models, including classes containing 252, 
216, 186 or 166 hexamers, were evaluated and top candidates from 
each class (119 of a total of 724) were tested by cross-correlation 
between the model and the capsid density (Supplementary Fig. 7). 
Two fullerene models, I and II, comprising 12 pentamers, as well as 
216 and 186 hexamers, respectively, were selected to generate all-atom 
molecular dynamics models of the entire HIV-1 capsid, comprising 
1,356 (model I, Fig. 4c, d) or 1,176 (model II, Supplementary Fig. 7b) 
CA subunits. These models were confirmed and analysed through a 
fully solvated, unconstrained 100ns, 64 million atom, molecular 
dynamics simulation as described in Supplementary Material (Sup- 
plementary Table 1). 

The resulting capsid model I contains a variable CTD structure, with 
an average root mean squared deviation of 3.6 + 0.5 A for backbone 
atoms between all 216 hexamers, compared with 2.3 + 0.4A for the 
NTD. Analysis of the bite angle between the neighbouring hexamers (or 
pentamers) shows a bimodal distribution (Supplementary Fig. 8a), with 
a minor population of sharp bite angles (mean = 139°) located around 
pentamers (Supplementary Fig. 8b), in addition to most continuously 
varying bite angles (144-180°). Compared with the recent cryo-EM 
structure of an immature intermediate of a different retrovirus”, our 
mature capsid structure shows a very different set of inter-subunit 
interactions, suggesting large conformational changes accompany ret- 
rovirus maturation. Our atomic models of the complete HIV-1 capsid 
further highlight the three-fold CA CTD trimer interface as an attrac- 
tive therapeutic target and provide a platform for future studies of 
capsid function and for targeted pharmacological intervention. 


METHODS SUMMARY 


CA tubes were assembled at 2 mg ml ' ina buffer containing 1 M NaCl and 50 mM 
Tris-HCl, pH 8.0. Cryo-EM micrographs were collected on Kodak SO- 163 films, on 
an FEI Polara microscope operated at 200 kV, and digitized at a pixel size of 1.09 A. 
Iterative Helical Real Space Reconstruction’’ was used to estimate the helical para- 
meters. The final refinement was performed using a previously established 
method". Cryo-ET tilt series of native HIV-1 cores were collected on a Gatan 
4k X 4k CCD (charge-coupled device) camera by tilting the specimen from —70° 
to 66°, with a total dose of approximately 120e /A? and a defocus value around 
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8um. Three-dimensional tomograms were reconstructed using IMOD and 
TOMO3D software**’’. MDFF'* was applied for 10 ns using NAMD2.9 (ref. 28) 
at 2 fs time-steps and with helical symmetry restraints. The MDFF-derived HOH 
structure was further equilibrated for 425 ns. On the basis of the equilibrated HOH 
model and a published pentamer structure 3P05 (ref. 10), a POH model was built 
and equilibrated for 1.5 1s. A total of 71 MDFF-derived hexamers were docked into 
the cryo-EM map and equilibrated for 125 ns. Simulations of the complete HIV-1 
capsid (64 million atoms, 100 ns) were performed with NAMD. Virus particles were 
derived by transfection of the full-length HIV-1 proviral construct R9 and mutant 
derivatives into 293T cells, and infectivity was quantified by titration on HeLa-P4 
cells. Capsid stability was determined by measuring the amount of CA protein 
present after purification of HIV-1 cores, and the rate of uncoating was analysed 
by quantifying CA release from purified cores at 37 °C. For crosslinking analysis, 
particles were pelleted from the supernatants of transfected 293T cells, analysed by 
non-reducing SDS-PAGE, and CA was detected by immunoblotting. 
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Crystal structure of a nitrate/nitrite exchanger 


Hongjin Zheng’, Goragot Wisedchaisri! & Tamir Gonen! 


Mineral nitrogen in nature is often found in the form of nitrate 
(NO; _). Numerous microorganisms evolved to assimilate nitrate 
and use it as a major source of mineral nitrogen uptake’. Nitrate, 
which is central in nitrogen metabolism, is first reduced to nitrite 
(NO, _) through a two-electron reduction reaction”*. The accumula- 
tion of cellular nitrite can be harmful because nitrite can be reduced 
to the cytotoxic nitric oxide. Instead, nitrite is rapidly removed from 
the cell by channels and transporters, or reduced to ammonium or 
dinitrogen through the action of assimilatory enzymes’. Despite 
decades of effort no structure is currently available for any nitrate 
transport protein and the mechanism by which nitrate is transported 
remains largely unknown. Here we report the structure of a bacterial 
nitrate/nitrite transport protein, NarK, from Escherichia coli, with 
and without substrate. The structures reveal a positively charged 
substrate-translocation pathway lacking protonatable residues, suggest- 
ing that NarK functions as a nitrate/nitrite exchanger and that 
protons are unlikely to be co-transported. Conserved arginine residues 
comprise the substrate-binding pocket, which is formed by asso- 
ciation of helices from the two halves of NarK. Key residues that are 
important for substrate recognition and transport are identified 
and related to extensive mutagenesis and functional studies. We 
propose that NarK exchanges nitrate for nitrite by a rocker switch 
mechanism facilitated by inter-domain hydrogen bond networks. 
The nitrate/nitrite porter (NNP) family of membrane proteins evolved 
to efficiently translocate the ionic molecules NO; and NO, across 
the membrane*”. Two nitrate/nitrite transport proteins NarK and 


Superposition 


Figure 1 | The crystal structure of NarK. a, Part of TM2 of NarK is shown 
in stereo view with a sigmA-weighted 2F, — F. map at 2.6 A resolution, 
contoured at 1.0. b. Left, NarK structure viewed from the plane of membrane 
with the putative location of the lipid bilayer as indicated. NarK is coloured 
in rainbow with the N terminus in blue. Right, NarK viewed from the 


NarU were identified in Escherichia coli*"'°. NarK proteins have been 
shown to catalyse either nitrate/nitrite exchange or nitrate uptake, pre- 
sumably by symport with a proton’””°. The former activity would be 
associated with respiration, whereas the latter could be associated either 
with respiration or the net assimilation of nitrite into cell material. (An 
additional membrane protein, NirC, functions as a H” /nitrite channel 
in E. coli but is not a member of the NNP"®.) Nitrate is the preferred 
source of nitrogen for plants and at least 16 nitrate/nitrite transport 
proteins have been identified". In plants the function of NNP proteins 
is probably related solely to the net assimilation of nitrogen. 

The NNP family belongs to the major facilitator superfamily (MFS) of 
secondary transporters. MFS members show specificity to a wide range 
of molecules’*. Although more than 58 distinct families of transporters 
make up the MFS, representatives of only six such families have been 
crystallized and their structure determined'*’. These six representative 
transporters require protons for their function. All MFS members are 
postulated to function through the rocker switch mechanism’. They 
all share a common structural topology, but share little or no sequence 
homology. 

Here we report the first crystal structure ofa nitrate transport protein 
that also transports nitrite. The structure of the E. coli NarK with and 
without substrate was determined by X-ray crystallography. Functionally 
important residues that form the substrate-binding pocket are identified 
and related to previously described mutagenesis and functional studies. 
We provide the first evidence that NarK functions as a nitrate/nitrite ex- 
changer, and that protons are probably not co-transported with the substrate. 


Periplasm 
5 —— 


periplasmic side. The identity of the 12 transmembrane helices is indicated. 
c, The N-terminal domain (TM1-TM6) of NarK (blue) is pseudo-symmetric to 
the C-terminal domain (TM7-TM12) (yellow) and can be superimposed with 
an r.m.s.d. of 2.9 A. d, Cut-away surface representation of the inward-facing 
NarK shows the central cavity exposed to the cytosol. 


1Janelia Farm Research Campus, Howard Hughes Medical Institute, 19700 Helix Drive, Ashburn, Virginia 20147, USA. 
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Figure 2 | The substrate-binding site in NarK. a, Two highly conserved 
nitrate signature motifs in TM5 and TM11 (blue helices) at the centre of 
NarK form the nitrate/nitrite transport pathway. b, The substrate-binding 
pocket is defined by two evolutionarily conserved and functionally important 
arginine residues R89 and R305. The binding site is capped above and below by 
F267 and F147, respectively. c, R89 and R305 are stabilized by inter-domain 
hydrogen bonds as depicted. The two halves of NarK are indicated as blue 


We overexpressed NarK, and purified the protein to homogeneity as 
described in the Methods section. Well-ordered high-quality crystals 
were obtained when NarK was co-crystallized with the Fab fragment of 
a monoclonal antibody we developed. The data extended to 2.6 A were 
phased using molecular replacement with Fab as a search model. The 
resulting density map was of high enough quality, allowing us to build 
and refine the NarK structure (Fig. 1a). The asymmetric unit contains 
one molecule of NarK forming a complex with one molecule of Fab 
(Supplementary Fig. 1). NarK, as other MFS proteins, is structurally 
divided into two domains, the amino-terminal half and the carboxy- 
terminal half each consisting of six transmembrane helices (TM1- 
TM6 and TM7-TM12, respectively) (Fig. 1b, c). The two domains 
are connected by a long loop between TM6 and TM7 (disordered in 
our structure), and it is thought that the substrate transport pathway is 
localized at the interface between these two domains. NarK appears to 
be in the inward-facing conformation as the hydrophilic central cavity 
is exposed to the cytosolic side (Fig. 1d). A detailed description of the 
crystal packing and the overall architecture of NarK can be found in the 
Supplementary Information. 

All members of the NNP family contain two stretches of conserved 
residues called the nitrate signature (NS) motifs (Supplementary 
Fig. 2). The nitrate signature motifs are not found in other MFS mem- 
bers, but are a unique feature of the NNP family”’. In NarK, the NS1 
motif is formed by residues 164-175 (GGALGLNGGLGN) located on 
TM5. The NS2 motif of NarK is formed by residues 408-420 
(GFISAIGAIGGFF) located on TM11 (Fig. 2a, blue). The nitrate sig- 
nature motifs in NarK are located at the centre of the protein, lining 
part of the substrate transport pathway (Fig. 2a, right). Both of the 
nitrate signature motifs are glycine-rich, which ensures a tight fit 
among the surrounding helices. The result is a significantly more 
compact structure for NarK when compared to other known struc- 
tures of MFS members (Supplementary Fig. 3). 

To transport anions like NO; and NO, , polar residues lining the 
central pore are most likely to form the substrate-binding pocket and be 
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(N-terminal domain) and yellow (C-terminal domain). d, Nitrite-bound 
structure of NarK. The density for nitrite is observed at the substrate-binding 
site after soaking the NarK crystals with sodium nitrite. R305 undergoes a 
conformational change upon substrate binding. This conformational change 
affects the inter-domain network of hydrogen bonds between Y263, G171 and 
R305. The displayed map is a sigmaA-weighted F, — F, at 30. 


involved in substrate recognition and transport. Two arginine residues, 
R89 from TM2 and R305 from TM8, are absolutely conserved among 
all nitrate/nitrite transporters in both prokaryotes and eukaryotes 
(Supplementary Fig. 2). Structurally, R89 and R305 are in plane and 
appose one another at the very centre of NarK, with their side chains 
extending well into the central cavity of the transporter (Fig. 2b). These 
arginines are capped by two phenylalanine residues: F267 above and 
F147 below. Together the arginines and the phenylalanines form the 
substrate-binding pocket. The only bulky side chain in plane with the 
arginines is Y263, which forms a hydrogen bond with R305 (Fig. 2c). 
The two arginine side chains are stabilized by an intricate system 
of inter-domain hydrogen bonds that link the two halves of NarK 
(Fig. 2c). Mutation of the residues described above led to a complete 
loss of function in NarK and its close homologues (Supplementary Fig. 2 


Table 1 | Mutagenesis and functional study of key residues import- 
ant for nitrate/nitrite exchange 


Nark NarU NrtA Colony growth on nitrate 
(E. coli) (E. coli) (Aspergillus nidulans) 

R89 R87K* R87K+ ae 
R87P* R87T+ = 
N175 168At = 
N168Q+ = 
N168Ct+ = 
Y263 Y261Q* a 
Y261N* se 
R305 R303Q* R368K+ + 
R303C* R368Q+ = 
R368Ct+ _ 
A415 N459At Ss 
N4590+ te 
N459Kt+ = 
N459C+ = 


*Study published in ref. 10. 
+Study published in ref. 24. 
{Study published in ref. 21. 
See references above for a comprehensive list of all available mutagenesis data. 
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Figure 3 | Protons are probably excluded from the substrate translocation 
pathway of NarK. a, Location of histidine, aspartate and glutamate residues in 
the anion transporter GlpT. Acidic residues line the substrate translocation 
pathway. b, Electrostatic surface representation for each domain of GlpT 
showing a relatively even distribution of positive and negative charges in the 
substrate translocation pathway. c, Location of histidine, aspartate and 
glutamate residues in the fucose transporter FucP. Acidic residues line the 


and Table 1). A detailed discussion of the relevant mutational and 
functional studies can be found in Supplementary Information. 
Soaking our NarK crystals with sodium nitrate deteriorated the crystal 
packing and did not yield meaningful data. In sharp contrast, soaking 
the crystals with sodium nitrite did not significantly affect crystallinity 
and yielded data to 2.8 A resolution (Supplementary Table 1), allowing 
us to visualize the nitrite bound in the substrate-binding pocket (Fig. 2d 
and Supplementary Fig. 4). Overall the structure of substrate-free 
NarK and nitrite-bound NarK are very similar, having an all Co-atom 
root mean squared deviation (r.m.s.d.) of 0.6 A (Supplementary Fig. 5). 
This is not surprising because NarK is probably stabilized in the inward- 
facing conformation by crystal contacts and the Fab could further 
restrict protein movement (Supplementary Fig. 1). Nevertheless, clear 
densities were observed for nitrite in the substrate-binding pocket 
(Fig. 2d and Supplementary Fig. 4). Nitrite was observed in-plane with 
R89 and R305 at the substrate-binding pocket where it is capped above 
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substrate translocation pathway. d, Electrostatic surface representation for each 
domain of FucP. e, Location of histidine, aspartate and glutamate residues in 
NarK. No acidic residues are found in the substrate translocation pathway of 
NarK. f, Electrostatic surface representation for NarK showing a dominantly 
positively charged substrate translocation pathway. It represents a formidable 
barrier for the translocation of protons but could attract negatively charged 
molecules like nitrate and nitrite. 


and below by F267 and F147, respectively (Fig. 2d). This binding 
configuration stabilizes the substrate via the m-electron delocalization 
among the arginine and phenylalanine side chains. Arginine R305 
changed its conformation so that the inter-domain hydrogen bond 
network involving Y263 and G171 was disrupted upon nitrite binding. 

Although it is clear that NarK and other NNP members are capable 
of NO; uptake and NO, export, it is not clear what the mechanism 
is or whether the process is proton coupled. Three distinct modes of 
action have been proposed: H*/NO3_ symport, H*/NO,_ antiport, 
ora NO; /NO,~ exchange without H* translocation!?”. 

Typically, channels and transporters that translocate protons use residues 
that are capable of protonation or deprotonation along the pore or 
substrate pathway’’. For example, the lactose permease LacY, which 
co-transports lactose with protons, uses glutamate and histidine resi- 
dues to translocate the protons". The glycerol-3-phosphate/phosphate 
antiporter GlpT uses a protonated histidine to facilitate substrate 
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Figure 4 | Proposed mechanism of nitrate/nitrite exchange. a-f, Six 
conformations of NarK are depicted as outward facing (a), outward facing with 
nitrate bound (b), occluded with nitrate bound (c), inward facing with nitrate 
release (d), inward facing with nitrite bound (e), occluded with nitrite bound 
(f). Once NarK completes the cycle and returns to the outward facing 


binding"* (Fig. 3a, b). The fucose transporter FucP uses glutamates and 
aspartates to translocate protons together with substrate’® (Fig. 3c, d). 
NirC, which is a proton-coupled nitrite channel has a functionally 
important histidine residue at the centre of the channel”. In these four 
examples, glutamates, aspartates and/or histidines are found on trans- 
membrane helices with their side chains extending into the substrate 
translocation pathway. 

NarK does not contain glutamates, aspartates or histidines on its trans- 
membrane helices. Instead, all glutamates and aspartates are found on 
either cytoplasmic or periplasmic soluble domains of NarK, well away 
from the substrate translocation pathway (Fig. 3e, balls and sticks). Moreover, 
NarK only contains three histidine residues and these are also found on 
soluble loops. Therefore NarK has no candidate residue for proton 
translocation or deprotonation in its substrate translocation pathway. 

Consistent with the above postulate, surface electrostatic potential 
calculations in NarK indicate that the substrate translocation pathway 
is highly positively charged. The electrostatic potentials for the N- and 
C-terminal halves of NarK are presented in Fig. 3f. The positively 
charged pathway can facilitate the transport of the negatively charged 
nitrate and nitrite anions, but at the same time it would represent a 
formidable barrier for the translocation of protons. In sharp contrast, 
MEFS members that couple the movement of substrate to the move- 
ment of protons, have a much more balanced electrostatic distribution 
in their translocation pathway (Fig. 3). We note that it is possible that 
protons could be required for NarK activation, but our data indicate 
that NarK is a nitrate/nitrite exchanger in which protons are not co- 
transported with the substrate. Moreover, it is still not clear whether 
NarK could function in nitrite transport alone and if so, whether this 
could be bidirectional. Further functional studies would have to be 
performed to answer these questions. 
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conformation nitrite is released to the periplasm. The proposed mechanism is 
based on the rocker switch’’. For NarK the rocker switch is facilitated by 
breaking and reforming inter-domain hydrogen bonds involving R89 and R305 
as described in Fig. 2. 


Our data show that NarK functions as a nitrate/nitrite exchanger. 
We propose the following mechanism for nitrate/nitrite exchange. The 
mechanism is based on the structural analysis that is presented above, 
and the previously proposed rocker switch’ (Fig. 4). Our proposed 
mechanism of action begins with NarK in the outward conformation 
in which the substrate translocation pathway is open to the periplasm 
(Fig. 4a). The positively charged substrate translocation pathway can 
attract a nitrate molecule, which can enter the pore and bind directly 
above the two arginines at the substrate-binding site (Fig. 4b and 
Supplementary Fig. 4). There the nitrate forms hydrogen bonds with 
R305 and N175 (Fig. 4b). The binding event could then trigger a confor- 
mational change in NarK into the transient occluded state where the 
pore is closed both at the periplasm and cytoplasm (Fig. 4c). The 
conformational change could push the nitrate from above R89 and 
R305 to directly below, and as the transporter adopts the inward con- 
formation, its substrate translocation pathway opens to the cytoplasm 
and nitrate can then be released (Fig. 4d). As nitrate is released it is 
exchanged with nitrite. Nitrite enters the substrate translocation path- 
way and binds in plane with R89 and R305 (Figs 2d and 4e). The 
binding of nitrite at the substrate-binding site triggers the conforma- 
tional change of NarK from its inward-facing conformation into the 
outward-facing conformation via the transient occluded state (Fig. 4e, f). 
During this process nitrite is pushed directly above R89 and R305 and 
once NarK is facing outward the nitrite is released into the periplasm 
(Fig. 4a). The cycle of exchange can then continue. 

Such a rocker switch mechanism would require a hinge to allow the 
two halves of NarK to rock against one another as described above. We 
propose that the hinge is formed by the inter-domain hydrogen bonds 
involving the conserved arginine residues R89 and R305. As discussed 
above, these two residues are stabilized by inter-domain hydrogen 
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bonding: Y263 from the C-terminal domain and G171 from the 
N-terminal domain of NarK form hydrogen bonds with R305; and 
G144 of the N-terminal domain and $411 of the C-terminal domain of 
NarK form hydrogen bonds with R89 (Fig. 2c). As the substrate inter- 
acts with R89 and R305, it would disrupt the hydrogen bond networks 
between the two domains that are mediated by the arginines. Our 
structure of nitrite-bound NarK indicates that R305 undergoes a con- 
formational change upon substrate binding consistent with a break in 
the inter-domain hydrogen-bonding network (Fig. 2d). We propose 
that it is the breaking and reforming of these inter-domain hydrogen 
bonds through the arginines that allow the two halves of NarK to rock 
against one another as depicted in Fig. 4. Consistent with this postulate, 
G171, G144, S411 and Y263 are residues that are conserved in NNP 
members (Supplementary Fig. 2). 

The rocker switch mechanism proposed in other MFS members whose 
structures are known involves the breaking and the formation of salt 
bridges and hydrogen bonds between various protein residues'*""””*. 
As discussed above, NarK contains no acidic residues in its pore so salt 
bridges could only form between protein residues and the substrate, 
but salt bridges between various protein residues are unlikely to be 
involved in its mechanism of action. Instead, the pore of NarK is highly 
positively charged, probably to exclude protons and to attract anions 
like nitrate and nitrite, whereas the rocking seems to involve the 
breaking and formation of inter-domain hydrogen bonds at the sub- 
strate-binding pocket. It remains to be seen as more structures of MFS 
members from various families are determined whether other mem- 
bers use a similar pattern of hydrogen bond breaking and formation for 
their function. 


METHODS SUMMARY 


NarK from E. coli strain K12 was overexpressed in E. coli BL21 (DE3) C41. Fab 
antibody fragments were generated as described in Methods. The NarK-Fab 
complex was purified in the presence of 0.2% (w/v) n-decyl-B-b-maltoside and 
crystallized in the following condition: 0.1 M citric acid (pH3.5), 0.1 M NaCl, 
0.1M LipSO,4 and 28% PEG400. Nitrite-bound NarK-Fab crystal was obtained 
by soaking the NarK-Fab crystal in the buffer containing 50 mM sodium nitrite. 
Diffraction data sets of both crystals were collected at the Advanced Light Source 
(beamline 8.2.2). Data processing and structure determination were performed 
using the HKL2000, COOT and CCP4 programs. 


Full Methods and any associated references are available in the online version of 
the paper. 
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LETTER 


METHODS 

Protein expression and purification. The gene encoding full-length NarK from 
E. coli strain K12 was subcloned into pET15b (EMD Millipore) with a modified 
N-terminal 8X His-tag and a thrombin cleavage site. NarK was overexpressed in 
E. coli BL21 (DE3) C41 at 37°C for 4-5h with 0.3 mM IPTG as inducer. After 
induction, the cells were collected by centrifugation and resuspended in lysis buffer 
containing 20 mM Tris-HCl (pH 8), 150 mM NaCl and 1 mM PMSF. The resus- 
pension was then passed through a microfluidizer (Microfluidics Corporation) 
twice at ~15,000p.s.i., followed by centrifugation at 15,000g for 30 min. The 
supernatant was collected and centrifuged at 130,000g for 1 h. The pellets contain- 
ing membrane were resuspended in the same lysis buffer and frozen at —80°C 
until use. 

To purify NarK, an aliquot of frozen membrane was thawed and solubilized 
with 1% n-decyl-B-b-maltopyranoside (DM) at 4°C for 2h. After addition of 
20 mM imidazole followed by centrifugation, the supernatant was applied to Ni** 
nitrilotriacetate affinity resin (Ni-NTA). The resins were washed with 50 mM 
imidazole in the buffer containing 20 mM Tris-HCl (pH 8), 150 mM NaCl, 0.2% 
DM. Full-length NarK was then eluted with 250 mM imidazole in the same buffer. 
The His-tag was removed by thrombin digestion at an enzyme:protein molar ratio 
of 1:1,000 at 4 °C overnight. The enzyme-treated protein was further purified by gel 
filtration (Superdex-200) in 20 mM Tris-HCl (pH 8), 150 mM NaCl and 0.2% DM. 
NarK in the peak fractions was collected. 

Fab production. Standard protocol” was used to generate the mouse IgG mono- 
clonal antibodies against NarK. Western blot and native ELISA was performed to 
assess the binding affinity and specificity of the antibodies generated from hybri- 
doma cell lines. Several monoclonal antibodies with strongest binding affinity 
were then purified from the hybridoma supernatants by protein A affinity chro- 
matography. Fab was produced by papain digestion and purified by protein A 
affinity chromatography. 

Assembly of NarK-Fab complex. Purified NarK and Fab were mixed at a molar 
ration of 1:2, and incubated at 4 °C for 30 min. The complex was then concentrated 
and purified by gel filtration chromatography with 20mM Tris-HCl (pH8), 
150 mM NaCl and 0.2% DM. A clear and complete peak shift to higher molecular 
weight was observed, indicating homogeneous NarK-Fab complex formation. 
The purified protein complex was collected and concentrated to 5.8 mg ml’. 
Crystallization. Initial hanging-drop crystallization assay with purified NarK 
produced crystals grown in a large range of crystallization condition with PEG 
molecules. However, these crystals gave anisotropic diffraction to around 4 and 
6 A. High-quality crystals were obtained only when NarK was co-crystallized as a 
complex with Fab. The best crystal, which diffracted to ~2.5 A, was obtained with 
Fab prepared from hybridoma line 4G5 (IgG2a, kappa) in the following precip- 
itant condition: 0.1 M citric acid (pH3.5), 0.1M NaCl, 0.1M Li,SO, and 28% 
PEG400. Before data collection, the crystals were soaked in a cryoprotectant buffer 
containing 30% PEG400 in the same precipitant solution for 5 min, and rapidly 


frozen in liquid nitrogen. For soaking, the NarK crystals were transferred into 
a precipitant solution containing 50 mM sodium nitrite or sodium nitrate over- 
night, and then frozen as above. Nitrite-bound-NarK crystals diffracted to ~2.8 A 
resolution. 
Data collection and structure determination. The data sets were collected at the 
Advanced Light Source (beamline 8.2.2), and processed with HKL2000”* to 2.6 A 
(substrate-free NarK) and 2.8 A (nitrite-bound NarK) resolution. Further structure 
determination and refinement were accomplished using the CCP4 package”. 

The structure was determined by molecular replacement using the program 
Phaser”’ with a polyalanine Fab fragment derived from the Protein Data Bank 
(PDB ID 1F8T’’) as a search model. Phases from molecular replacement were 
significantly improved after cycles of density modification using the program 
Parrot”, and the electron density for the 12 transmembrane helices became appar- 
ent. Manual model building was carried out for the Fab using the program Coot”, 
followed by structure refinement using the program Refmac’’. To facilitate model 
building of NarK, 20-residue polyalanine helices were placed in the asymmetric 
unit by molecular replacement. Subsequent cycles of density modifications, model 
building and refinement were carried out until structure completion. The final 
model contains one molecule of NarK (residues 12-458) and one molecule of the 
heavy and light chains of the Fab in the asymmetric unit. Data collection and 
refinement statistics are presented in Supplementary Table 1. 

All figures in this paper were prepared with Chimera version 1.6.2** or Pymol 
version 1.5** and assembled in Photoshop CS6 (Adobe). Supplementary Fig. 2 was 
prepared using the program Clustal X**. 
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CORRECTIONS & AMENDMENTS 


ERRATUM 
doi:10.1038/naturel2196 


Erratum: Basic amino-acid side 
chains regulate transmembrane 
integrin signalling 

Chungho Kim, Thomas Schmidt, Eun-Gyung Cho, Feng Ye, 
Tobias S. Ulmer & Mark H. Ginsberg 


Nature 481, 209-213 (2012); doi:10.1038/nature10697 


The legend to Fig. 1d of this Letter incorrectly referred to “a POPS 
lipid’s amino NH;* group (blue)”. It should read “a POPC lipid’s 
choline N(CH3)3_ group (blue)”, where POPC is 1-palmitoyl-2- 
oleoyl-sn-glycero-3-phosphocholine. The original legend to Fig. 1d 
has been corrected. 


RETRACTION 
doi:10.1038/naturel12164 


Retraction: Branched tricarboxylic 
acid metabolism in Plasmodium 


falciparum 


Kellen L. Olszewski, Michael W. Mather, Joanne M. Morrisey, 
Benjamin A. Garcia, Akhil B. Vaidya, Joshua D. Rabinowitz 
& Manuel Llinas 


Nature 466, 774-778 (2010); doi:10.1038/nature09301 and 
corrigendum Nature 469, 432 (2011); doi:10.1038/nature09712 


We retract this Letter, which reported both reductive and oxidative 
tricarboxylic acid (TCA) metabolism in Plasmodium falciparum para- 
sites (‘branched TCA metabolism’). The data for metabolic labelling 
of TCA intermediates remain reliable, but we have come to realize that 
the interpretation presented is incorrect. Although there is both reduc- 
tive and oxidative TCA cycle flux in P. falciparum-infected red blood 
cell (RBC) cultures (as we reported), new data from the Llinas and 
Vaidya groups (manuscript in preparation) suggests that the reductive 
flux occurs primarily in the RBCs and not in the parasite itself. 
Specifically, we have used new enrichment strategies for the parasitized 
RBCs that enhance our ability to measure P. falciparum-infected RBC 
metabolic activity without excessive interference from surrounding 
uninfected RBCs. On feeding '*C; glutamine, we measured both 
13C,-succinate and ‘*C,4-succinate in the infected RBCs, demonstrat- 
ing that TCA metabolism in blood-stage P. falciparum is not branched 
but primarily oxidative (cyclic). 
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The right tool 


Research-tool developers must be creative, innovative and 
willing to collaborate with people from a variety of fields. 


BY KELLY RAE CHI 


regory Buckner once dreamed of being 
(F: surgeon. But his father, a civil engi- 
neer, convinced him to try engineer- 
ing. Now, as a mechanical engineer at North 
Carolina State University in Raleigh, Buck- 


ner combines the two interests. He creates, 
builds and develops biomedical tools such as 


robotic catheters for use in minimally invasive 
cardiothoracic surgery. Buckner does not work 
directly with chest pain or lacerations. Instead, 
he deals with batteries and circuit boards and 
plastic tubing. 

Tools from lab instruments to software can 
refine research and open new fields of enquiry. 
And the people who make them are a varied 
bunch — they might be full-time university 
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researchers, or they might be hired by larger 
teams in academia or industry. The boundaries 
are fluid. Buckner, for example, has worked as a 
consultant, licensed his inventions to industry 
and even launched his own company. 

Industry jobs involve specific goals with 
fast turnarounds. They are often at companies 
that sell scientific instrumentation, medical 
devices or software, such as Life Technologies, 
based in Carlsbad, California; Oxford Instru- 
ments in Abingdon, UK; and Medtronic in 
Minneapolis, Minnesota. Options are many 
for those keen on focusing their skills and 
scientific know-how on tools rather than lab 
research. 

“There's certainly a diversity of ways to 
impact science through tool building,” says 
Eric Betzig, a group leader at the Howard 
Hughes Medical Institute’s Janelia Farm 
Research Campus in Ashburn, Virginia, who 
at Janelia and elsewhere has led the develop- 
ment of microscopes that can image features 
smaller than half the wavelength of the light 
used to illuminate them. Often overlooked in 
the past, designing and building biomedical 
tools has been receiving growing respect as a 
career track. There are now grants and publica- 
tions dedicated to the craft, and toolmakers are 
increasingly integrated into research teams to 
help not only with the development of tools, 
but also with their application. Although they 
may not be responsible for the actual findings, 
toolmakers can help to overcome research 
obstacles in the lab or the clinic. 


ALL WALKS OF LIFE 
Successful tool builders typically have exper- 
tise in fields such as engineering, computer 
science, bioinformatics, maths or physics — 
and occasionally biology. They also tend to 
have a willingness to learn, a knack for pick- 
ing up mechanical and research skills and 
enthusiasm for collaboration. Members of 
Janelia Farm’s instrument design and fabrica- 
tion group, for example, have worked in the 
defence and automotive industries. They learn 
neuroscience on the job at Janelia, says Chris 
Werner, the group’s shared-resource direc- 
tor. Tool builders shadow scientists in the 
laboratory until they understand the problem 
at hand, and they may even be named as co- 
authors on publications. “Besides technical 
skills, what’s really important and makes the 
difference is their soft skills — communica- 
tion, giving credit, taking credit — the ability 
to play nice,” says Werner. 

Toolmakers need to be good listeners, > 
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> alert to the challenges of bench research — 
whether those are imaging a live cell or analys- 
ing a huge number of data. Developing tools to 
solve such problems involves making multiple 
test runs and repeatedly conferring with users 
or customers to make sure that the instru- 
ment or algorithm is doing what it needs to. 
This can be a big challenge — often researchers 
are unsure of exactly what they need. And suc- 
cess can mean being anchored to a project for 
a long time; tool builders frequently get roped 
into collaborations and companies centred on 
their technologies. 

Armon Sharei, a PhD student in chemical 
engineering at the Massachusetts Institute of 
Technology in Cambridge, saw at first hand 
the healthy dose of trial and error that can be 
involved in tool development. Sharei embarked 
on the path to an invention by chance, when he 
was trying to shoot cells with a jet of liquid to 
deliver proteins, DNA, nanoparticles and other 
large molecules into the cells for research and 
clinical applications. The delivery worked, but 
not for the reasons that he expected. Instead 
of directly injecting the membrane with fluid, 
the jet seemed to be 
squashing the cells 
— squeezing them so 
much that it caused 
their membranes to 
open temporarily 
and allow the fluid to 
diffuse in. But deliv- 
ery efficiency was 
poor. “The prospects 
didn't look good but 
what kept us going 


_p 


is that, in principle, “That lucky first 
this method was so observation 
simple it couldmake OF experiment 
abig differenceifwe comes to 
could improve it? everyone. It’s 
says Sharei. just amatter of 
His researchadvis- recognizing it 
erstookariskandthe and developing 


group overhauled 
the design. The team 


its potential.” 
Armon Sharei 


eliminated the jet and 
created different versions of a microfluidic 
chip that worked by essentially squeezing the 
cells. The team patented the technology and 
in March launched SQZ Biotech in Boston, 
Massachusetts, to commercialize the device. 
Applications might include delivering repro- 
gramming proteins to convert fully developed 
cells into induced pluripotent stem cells. “I 
think that lucky first observation or experi- 
ment comes to everyone once in a while. It’s 
just a matter of recognizing what it is and 
developing its potential,” says Sharei. 
Serendipity aside, tool ideas often come 
from a deep understanding of customers’ 
needs, which provides insight into ways 
to make improvements. Ideas for most of 
Buckner’s developments — which include a 
finely controlled robotic catheter and a chest 
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retractor that measures force during open- 
heart surgery — come from watching sur- 
geons, cardiologists and radiation oncologists. 
“They are true experts in the field and know 
what the technical hurdles of their current 
practices are,” says Buckner, who has patents 
on multiple inventions. 

Tool development itself often spawns further 
ideas; one project may proliferate into many. 
Through a clinical collaboration in 2007-08, 
Buckner’s group was working to automate the 
tying of knots in sutures during surgery on 
the mitral valve of the heart. The researchers 
had to test their prototypes on live pigs, which 
proved both costly (at US$2,500 per pig) and 
time consuming. So they developed a ‘dynamic 
heart system’ that pumped fluid through a 
dead pig heart so that it functioned realisti- 
cally. The device allowed the team to test its 
technologies for less than $25 a go. 


SILVER LINING 

Toolmakers must be ready for some disap- 
pointment — and if the end product fails to live 
up to the original idea, they should be nimble 
enough to tweak it, find other applications or 
move on. Betzig experienced difficulties with 
near-field microscopy, a high-resolution imag- 
ing method. The technique turned out to have 
some physical limitations that made it less than 
ideal for its intended use in imaging biological 
samples. In the end, he moved on to differ- 
ent projects, although others have continued 
to pursue near-field microscopy. “The thing 
about tools is that they’re kind of like your 
children,” says Betzig. “When they’re born, 
you say, ‘Oh, he could be president! Or an 
astronaut! Cure cancer!” But toolmakers often 
come to the realization that their work might 
not have quite the impact that they antici- 
pated. Betzig says that his tools have failed in 
various ways — and that by focusing on the 
flaws, he can find opportunities to develop 
better versions. 

Once a tool builder based in academia 
believes that he or she has something ready 
for the marketplace, another set of challenges 
awaits. In the United States, for example, the 
developer must often file an invention disclo- 
sure with his or her university’s technology- 
transfer office, allowing the office to evaluate 
the potential for commercializing the tool. If it 
looks promising, the office then usually files a 
provisional patent application. If certain condi- 
tions specified by the US Patent and Trademark 
Office are met within 12 months, the applica- 
tion can be made non-provisional. 

After a patent is issued, there are numerous 
possibilities. The inventor might seek funds 
from industry or the government to refine 
the prototype. The technology-transfer office 
might work with the inventor to find com- 
panies that are willing to license it. Any tools 
geared for clinical use will need to be submitted 
for regulatory approval. And the inventor could 
launch a company with the help of partners or 
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Mechanical engineer Gregory Buckner (right) 
works with a graduate student. 


advisers who have business and legal exper- 
tise. After tools are sold, new applications 
might come to light, and the company can 
develop these or create spin-offs to do so. 


BUILDING SKILLS 

Just like researchers, toolmakers can learn 
through an apprenticeship of sorts. “As a 
more-junior graduate student, one can learn 
these skills very deeply by working with 
more-senior graduate students, postdocs and 
principal investigators,” says Samuel Hess, a 
physicist at the University of Maine in Orono 
who has invented high-resolution imaging 
methods. Degrees in fields such as biomedi- 
cal engineering or computational biology can 
help people to build skills at the intersection 
of toolmaking and medicine or biology. Even 
after obtaining their PhDs, toolmakers can 
learn or refine their skills by working with 
more-experienced researchers during a 
postdoctoral fellowship, on sabbatical or in 
industry. Whenever it happens, learning to 
build is a hands-on process — with a lot of 
mistakes made along the way, says Betzig. 

Many established toolmakers are essen- 
tially self-taught and self-motivated. “I hire 
lots of programmers and engineers. The best 
programmers are always former biologists 
who pick up the programming bug,” says 
Chris Beecher, an analytical chemist and co- 
founder of IROA Technologies in Ann Arbor, 
Michigan, who has invented platforms that 
quantify the small molecules produced by 
metabolism in cells or tissues. 

At Waters Corporation, an analytical science 
instrumentation company based in Milford, 
Massachusetts, some of the product devel- 
opers are not engineers, but scientists who 
have previous experience using analytical 


chemistry instruments as part of their research, 
so they know what customers need, says Steven 
Cohen, life-sciences director in research and 
development at the company. Such develop- 
ers work with a manufacturing workshop to 
design instruments and build prototypes. 

In academia and the non-profit sector, 
tool development is often funded through a 
larger research project. But researchers can 
also apply for grants specific to tool building. 
Genome Canada in Ottawa, for example, sup- 
ports the development of genomic technolo- 
gies through funding competitions. Naveed 
Aziz, the organization's director of technology 
programmes, says that he receives funding 
applications led by bioinformaticians as often 
as by researchers. And the US National Sci- 
ence Foundation (NSF) runs an Instrument 
Development for Biological Research grant 
to support development of tools that enable 
new research capabilities, markedly improve 
current technology or transform a prototype 
into something more broadly usable. 

Several US federal agencies — the NSE, the 
National Institutes of Health and the Depart- 
ment of Defense — also offer Small Business 
Innovation Research and Small Business 
Technology Transfer grants, which aim to 
spur technological innovation and lower the 
barriers to commercialization. The Well- 
come Trust in London, one of the world’s 
biggest biomedical research charities, uses 
its Translation Fund and Health Innovation 
Challenge Fund to speed the commercializa- 
tion of biomedical 
technologies. 

Grants like these 
can help toolmak- 
ers make a lasting 
difference to a given 
field. It is always 
nice to go “beyond 
the narrow scope” 
of the original pro- 
ject, notes Charles 
Schmitt, director of 
informatics at the 


RenaissanceCom- | One can learn 
puting Institute, these skills by 
based in Chapel Working with 
Hill, North Caro- more-sentor 
lina. But although graduate 

itis gratifying when students, 

a tool is adopted postdocs and 
widely, itsometimes principal 
means that tool- investigators. ‘“ 
makers must train Samuel Hess 


others to use their 


inventions, and must surrender control to 
groups with their own skill sets and agendas. 
Toolmakers can take pride in their impact, but 
lose command of their brainchild. Moving on, 
says Schmitt, “is always a big challenge”. m 


Kelly Rae Chi is a freelance writer based in 
Cary, North Carolina. 
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CHEMISTRY 
Improving lab safety 


A US National Research Council 
committee met on 15-16 May in 
Washington DC to start developing 

safety recommendations for chemical 
researchers in academic and national 
laboratories. The project was spurred in 
part by three high-profile accidents at US 
academic laboratories between 2008 and 
2011. Behavioural scientists, chemists 
and safety experts will identify problems 
at research institutions and learn from 
effective safety systems in industry to 
develop the recommendations, which 
will be released next spring. Chemists and 
behavioural scientists “can learn from each 
other,’ says committee chairman Holden 
Thorp, chancellor of the University of 
North Carolina at Chapel Hill. 


BRAIN DRAIN 


Workers flee corruption 


A growing number of highly skilled 
workers including researchers are leaving 
corrupt nations where government officials 
demand bribes and control access to the 
labour market, finds a study published on 
17 May (A. Ariu and M. P. Squicciarini 
EMBO Rep. http://doi.org/mkh; 2013). 
Nations with relatively low corruption 
benefit from an influx of scientists who 
write influential papers and patents 

and create businesses, says the study, 
which examined movement patterns 

in 123 nations against an international 
corruption index. “It is not a positive thing 
for a researcher to be in a country that is 
highly corrupt,’ says study co-author Mara 
Squicciarini, an economist at the Catholic 
University of Leuven in Belgium. 


IMMIGRATION 
Postdocs hit by scam 


Confidence tricksters are targeting 
international postdocs and students in the 
United States, according to complaints 
passed on to the National Postdoctoral 
Association (NPA) in Washington DC. 
Telephone callers claiming to be from US 
Citizenship and Immigration Services 
(USCIS) tell trainees that they have 
improperly completed a form, and that 
they face a fine or deportation. This is not 
agency procedure, says the NPA. Callers 
know the target’s name, date of birth, 
address, phone number and case number. 
People who suspect that they have been 
targeted should report the scam at 
go.nature.com/poexjx. The USCIS did not 
respond to requests for comment. 
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Ua SCIENCE FICTION 


A TIME FOR PEACE 


BY S. R. ALGERNON 


CC hy do you deserve a place here, 
Oscar?” asked the director, as 
he eyed Oscar from across a 


sturdy oak desk. A painting of Speer’s Volker- 
bundhalle and the Berlin skyline dominated 
the wall behind him. 

Oscar looked down at his fidgety knees to 
avoid the director's glare. He had not pre- 
pared himself for silence. He had imagined 
the Institute as a place where the air danced 
with the roar of engines and the crackle ofa 
welder’s torch, or at least the clack of chalk 
against slate. 

Nobody deserved a place at the Institute, 
thought Oscar. The Institute plucked them 
from the world like the tip of a tornado and 
hurled them into the maelstrom of history. 
Fermi put his studies on hold for three years 
and emerged in 1924 with a complete under- 
standing of nuclear fission. Alan Turing took 
a summer off from Sherborne and crossed 
the pond to New Jersey. By 1935, the first 
business computers rolled off the produc- 
tion line. 

Ten seconds passed. The director's office 
was quiet, except for the steady sweeping of 
a broom and the snip of hedge trimmers out- 
side the window. 

Oscar swallowed and tooka breath. 

“Tt all comes down to quantum tunnel- 
ling. We’ve always assumed that time travel 
involved moving through space-time in 
accordance with Dr Einstein's theories, but 
if Dr Bornis right...” 

“Tt is not a question of if,” said the direc- 
tor. He reached into his desk and pulled 
out a photograph. “This was our founding 
conference, after the Great War. Look in the 
third row.” 

An inscription in the upper left cor- 
ner read POTSDAM, 1919. Just below it, 
between Heisenberg and Rutherford, Oscar 
saw a bearded version of himself. 

“I was there?” 

“You and your boxcar full of journals and 
blueprints. You convinced Heisenberg, and 
he convinced the rest. Together, you drew up 
the plans for the twentieth century.” 

“So... lam, or will be, the first time trav- 
eller?” 

“The Oscar that appeared in Potsdam 
50 years ago is a matter of historical record. 
I want to know who you are. Are you — the 
you that sits in my office — necessary to 
move history onto its present path?” 

“The grandfather paradox,” said Oscar, 
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Future plans. 


with a chuckle. “It’s not a problem. Moving 
through time implies velocity and causa- 
tion. Quantum tunnelling through time is 
different. The time traveller simply appears 
ex nihilo from the quantum foam. The trav- 
eller’s emergence on the scene is, to be sure, 
an improbable event, but it has no destiny 
and it owes nothing to the future.” 


“So, you do not have a destiny?” 

“Maybe not,’ said Oscar, “but I hope you'll 
agree that I still have potential. I will find a 
way to unwind time to an earlier state. The 
history books tell us that Brown and Reitsch 
walked on the Moon in 1947. What if it were 
Lindbergh in 1927 or da Vinci in 1497?” 

The director stroked his chin. 

“And what about all the people in this 
timeline? Will they agree to wink out of 
existence to create your perfect world?” 

“Once the timeline is unwound, everyone 
here will be unrealized potential. They won't 
geta choice. After all, nobody gets to vote on 
what time it is.” 

“But you do. Or, you will. Doesn't that 
make you responsible for the people you 
leave behind?” 

“What can I do for them? They won't even 
exist. Am I obliged to consider every poten- 
tial human life, not just every embryo but 
every possible union of sperm and egg?” 

“Perhaps not,” said 
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intensive. Have you explained to your friends 
and relatives that you will be quite busy for 
some time and should not be disturbed?” 
“Of course.” 
“Good, said the director. He stood up and 
offered his hand. “Welcome. I will show you 
to your room.” 


The room, at the other end of the quad, was 
sparse and tidy, like a monk's cell. A grey 
jumpsuit lay neatly on the bed. At first, the 
sight startled him. Did Turing have to sweep 
the paths when he was here? Did Fermi have 
to trim the hedges? 

Oscar stepped tentatively up to the win- 
dow. Outside, a man sat with his back to 
Oscar, facing a watercolour of a Bavarian 
castle against the backdrop of an idyllic 
Wagnerian landscape. Oscar tried to lean out 
the window to get the artist’s attention, but it 
opened only a few centimetres, not enough 
for him to squeeze his head through. 

“Hallo, nachbar,” said Oscar, eager to try 
out a little German. 

The artist slowly turned, without lowering 
his brush. 

“Ja; said the painter, without inflection, 
looking vaguely towards Oscar's window. 
“Hallo, nachbar,? His head drifted back 
towards the canvas, and he added a few 
brushstrokes. 

Oscar tooka step back, unimpressed. He 
had expected to meet the next Einstein or 
Heisenberg, not some undersized Charlie 
Chaplin. He turned back to the director. 

“There must be some mistake,” said Oscar. 
“Tm no painter or gardener. What about my 
future? What about history?” 

The old man’s smile drew the skin taut 
over his face. 

“You may owe this world no debt, but we 
have our own obligations. I’m sure you'll 
understand if we prefer to leave the timeline 
as it is” 

Oscar stepped towards the door, but the 
director slammed it shut. On the other side, 
a deadbolt slid firmly into place. 

Oscar staggered backwards to the bed, 
hearing the sound of his own quickened 
heartbeat and the even cadence of pruning 
shears. After each slice, branches struck the 
pavement and waited for the broom to sweep 
them away. m 
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The role of epistasis in protein evolution 


ARISING FROM M. S. Breen, C. Kemena, P. K. Vlasov, C. Notredame & F. A. Kondrashov Nature 490, 535-538 (2012) 


An important question in molecular evolution is whether an amino 
acid that occurs at a given site makes an independent contribution to 
fitness, or whether its contribution depends on the state of other sites 
in the organism’s genome, a phenomenon known as epistasis’°. Breen 
and colleagues recently argued® that epistasis must be “pervasive 
throughout protein evolution” because the observed ratio between 
the per-site rates of non-synonymous and synonymous substitutions 
(dN/dS)’ is much lower than would be expected in the absence of 
epistasis. However, when calculating the expected dN/dS ratio in the 
absence of epistasis, Breen et al.° assumed that all amino acids observed 
at a given position in a protein alignment have equal fitness. Here, we 
relax this unrealistic assumption and show that any dN/dS value can in 
principle be achieved at a site, without epistasis; furthermore, for all 
nuclear and chloroplast genes in the Breen et al. data set, we show that 
the observed dN/dS values and the observed patterns of amino-acid 
diversity at each site are jointly consistent with a non-epistatic model of 
protein evolution. 

For a variety of proteins under purifying selection, Breen et al. 
constructed alignments and recorded the amino acids observed at 
each position; these observed amino acids were deemed “acceptable” 
with respect to natural selection. They then assumed that substitu- 
tions occur at neutral rates among the acceptable amino acids in order 
to calculate, for each protein, an expected value for dN/dS in the 
absence of epistasis. Because their empirical observations of dN/dS 
were much lower than these expected values, Breen et al.° concluded 
that epistasis must be extremely prevalent. 

The flaw in this reasoning is that Breen et al.° considered only a 
single class of fitness assignments, so that all amino acids observed at a 
site were assumed equally fit. A more realistic assumption is that some 
amino acids observed at a site are more fit than others*”. 

To illustrate the principle that low dN/dS can arise without epistasis, 
we considered a non-epistatic model in which, among the acceptable 
amino acids at a given site, one of these is preferable to the rest. We 
performed the following experiment: in a hypothetical protein of 
length 300 amino acids, for each position we randomly designated 
eight amino acids as acceptable (the average number of acceptable 
amino acids reported by Breen et al.°), but gave one of these a selective 
advantage over the rest. We then calculated the equilibrium dN/dS 
(ref. 10) for this protein as a function of the selective advantage 
of the preferred amino acid, 2Ns (Fig. 1). Whereas dN/dS is high for 
the case 2Ns = 0, corresponding to the Breen et al.° assumption, dN/dS 
is much lower for larger 2Ns. Thus, a large range of dN/dS values are 
consistent with non-epistatic models of protein evolution. 

Although non-epistatic models can in principle produce low dN/dS 
values (Fig. 1), can such a model account for the Breen et al.° data? To 
answer this question, we considered a more general non-epistatic 
model that assigns to each amino acid at a site a different fitness. 
For each gene in the Breen et al.° data set, we assigned fitnesses at each 
site in such a way that the resulting equilibrium distribution of amino 
acids under our model precisely matches the amino-acid frequencies 
observed for that site!’. Furthermore, as a result of these fitness assign- 
ments, the asymptotic mean pairwise sequence divergence under our 
model necessarily matches the mean pairwise divergence observed in 
the data (Table 1 and Methods). 

Using this model, for each gene in the Breen et al.° data set we 
repeatedly simulated the evolution of a pair of sequences from their 
common ancestor and computed dN/dS. For the 13 mitochondrial 
genes, the average simulated dN/dS values, although substantially 


lower than the Breen et al.° expectations, are still greater than the 
empirically observed values (Table 1). However, for the three nuclear 
and chloroplast genes in the Breen et al.° data set, the average dN/dS 
values under our non-epistatic model are comparable to or even lower 
than the empirical dN/dS values Breen and colleagues reported. Thus, 
the dN/dS values observed in these genes need not be attributed to 
epistasis, but rather can be explained by the more parsimonious 
assumption that the various amino acids observed at a site have different 
fitnesses. 

It is important to note that the effects of natural selection and 
phylogeny are confounded in the amino-acid frequencies observed 
at each site, and therefore in our fitness estimates. Although methods 
exist to disentangle these effects when the phylogeny is small and 
known'*”’, there is no well-accepted phylogeny for the vast range of 
taxa studied by Breen et al°. Nonetheless, whatever the true phylogeny 
may be, under the standard assumption that molecular evolution can 
be modelled as an equilibrium Markov chain (see, for example, ref. 14, 
as used by Breen et al.°) our fitness estimates are maximum likelihood. 
Relaxing this assumption, or allowing more complex models (for 
example, allowing fitnesses or population sizes to vary across time 
or clade), would make it only more difficult to reject the non-epistatic 
null hypothesis. 

In summary, Breen et al.° provide no direct evidence of epistasis, 
nor do they reject the full space of non-epistatic models. They have 
analysed only three non-mitochondrial genes, whose evolutionary 
patterns, we have shown, can be explained without epistasis. Although 
Breen et al.° contend that epistasis is the primary factor in all of 
molecular evolution, further work is needed to substantiate this claim. 
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Figure 1 | Non-epistatic models of protein evolution can produce low dN/ 
dS values. Expected dN/dS as a function of 2Ns for a hypothetical protein of 
length 300, in which eight acceptable amino acids are chosen at random for 
each position and one of these amino acids at random is assigned a selective 
advantage of size 2Ns. The remaining 12 amino acids are lethal. The Breen et al.° 
expectation for dN/dS in the absence of epistasis corresponds to 2Ns = 0. 
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Table 1| Observed and expected dN/dS values 


Gene Breen et al.° expected 


Our average simulated 


Breen et al.° empirical Our equilibrium mean Breen et al.° empirical 


dN/dS dN/dS dN/dS pairwise divergence pairwise divergence 
Mitochondrial 
ATP6 0.44 0.215 0.056 0.332 0.332 
ATP8 0.56 0.624 0.224 0.615 0.615 
COX1 0.28 0.078 0.015 0.188 0.188 
COX2 0.43 0.140 0.025 0.348 0.348 
COX3 0.32 0.144 0.036 0.290 0.290 
CYTB 0.51 0.117 0.039 0.242 0.242 
ND1 0.39 0.208 0.040 0.383 0.383 
ND2 0.51 0.262 0.067 0.398 0.398 
ND3 0.49 0.242 0.069 0.379 0.379 
ND4 0.42 0.239 0.045 0.433 0.433 
ND4L 0.49 0.369 0.076 0.502 0.502 
ND5 0.32 0.211 0.057 0.407 0.407 
ND6 0.42 0.397 0.073 0.554 0.554 
Nuclear 
EEF1A1 O11 0.031 0.020 0.080 0.080 
H3.2 0.14 0.014 0.037 0.019 0.019 
Chloroplast 
rbcL 0.40 0.024 0.072 0.056 0.056 


Comparison of expected dN/dS values and mean pairwise divergence with the empirical values for each gene in the Breen etal.®° data set. The Breen et al. expected dN/dS is based on the assumption that all amino 
acids observed at a given site are neutral relative to each other. Our expected dN/dS is based on the assumption that the various amino acids observed at a site have different fitnesses. 


METHODS 


We assume that each codon evolves according to an independent Markov chain, 
the rate matrix of which is determined by the scaled selection coefficient assigned 
to each amino acid’. The equilibrium frequency of each amino acid is then 
proportional to uje”* (ref. 15), in which u; is the number of codons that code 
for amino acid i, and 2Ns(i) is its scaled selection coefficient. After assigning site- 
specific fitnesses to amino acids, 1,000 simulations were conducted for each 
protein, as follows. For each site represented in at least half the sequences from 
the Breen et al.® alignment, an ancestral codon was drawn from the equilibrium 
distribution of our Markov chain. Two copies of this ancestral sequence were then 
evolved independently until dS = 0.25, which is within the range of dS = 0.05 to 
0.5 used by Breen et al®. We then estimated dN/dS for each pair using PAML™, 
again following the procedure of Breen et al. Mean pairwise divergence (Table 1) 
was calculated using the formula : Oo, (1 - pau ‘ i ) , in which fj, 
denotes the frequency of amino acid i at site j, and L the number of majority 
non-gapped sites in the protein. All computer code is available on request. 
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REPLYING TO D. M. McCandlish, E. Rajon, P. Shah, Y. Ding & J. B. Plotkin Nature 497, http://dx.doi.org/10.1038/nature12219 (2013) 


Understanding fitness landscapes, a conceptual depiction of the genotype- 
to-phenotype relationship, is crucial to many areas of biology. Two 
aspects of fitness landscapes are the focus of contemporary studies of 
molecular evolution. First, the local shape of the fitness landscape 
defined by the contribution of individual alleles to fitness that is inde- 
pendent of all genetic interactions. Second, the global, multidimensional 
fitness landscape’ shape determined by how interactions between 
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alleles at different loci change each other’s fitness impact, or epistasis. 
In explaining the high amino-acid usage (u), we focused on the global 
shape of the fitness landscape’, ignoring the perturbations at individual 
sites’. 

u is overly sensitive to the presence of rare amino acids’, including 
fixed slightly deleterious amino acid—a factor we did not consider. 
McCandlish et al.’ apply a non-epistatic model with fitness landscape 


©2013 Macmillan Publishers Limited. All rights reserved 


BRIEF COMMUNICATIONS ARISING 


ruggedness on the local scale of individual sites. As a null model for 
testing the hypothesis of an epistasis-free fitness landscape, it directly 
confirms our conclusion that epistasis affects most amino-acid sub- 
stitutions for 13 out of 16 genes considered’. The other three genes are 
extremely conservative with low density and frequency of emerged 
amino-acid states in the multiple sequence alignment. In the non- 
epistatic model’ such amino-acid states seem to be substantially dele- 
terious, leading to the low predicted dN/dS values in these three genes, 
with the largest effect in rbcL. 

In the absence of epistasis, strong selection against non-optimal 
states markedly decreases the equilibrium sequence divergence** and 
the expected time to reach the equilibrium divergence*. The model’ for 
rbcL simulates an equilibrium sequence divergence of ~0.06, which 
must be independent of phylogenetic distance beyond closely related 
clades*. Both of these predictions are easily falsified. Orthologous rbcL 
sequence divergence shows no sign of reaching a true equilibrium even 
between phylogeneticaly distant clades (Fig. 1), whereas a BLAST 
search reveals that sequence divergence between Arabidopsis thaliana 
and cyanobacterial orthologues reaches values greater than 0.16. 

Generally, the non-epistatic model has a trade-off between the 
strength of selection against suboptimal alleles and the expected 
sequence divergence, which rapidly reaches its equilibrium value*”. 
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Figure 1 | Sequence divergence as a function of phylogenetic distance. 
Average sequence divergence for pairwise comparisons of A. thaliana RbcL 
protein sequence and other orthologous sequences used in ref. 2. 


For extremely conservative genes, such as the selected three non- 
mitochondrial genes considered’, the non-episatatic model can give 
the appearance of avoiding this trade-off, which breaks down when 
the long-term evolutionary predictions of the model are considered in 
detail (Fig. 1). 

Two aspects of protein evolution are revealed by sequence similarity 
searches. First, protein sequence divergence occurs slowly, slower than 
neutral divergence. Second, sequence divergence is proportional to 
phylogenetic distance and is usually substantial for sequences from 
distantly related species. Non-epistatic models** that consider only 
local fitness landscape ruggedness are inconsistent with both of 
these basic and universal features of protein evolution. By contrast, 
our claim that epistasis—the global, multidimensional shape of the 
fitness landscape—is the primary factor of protein evolution explains 
the high amino-acid usage” and how slow long-term sequence diver- 
gence leads to highly dissimilar sequences’. Models that take into 
account both local and global aspects of fitness landscapes could lead 
to better quantification of factors shaping molecular evolution, although 
their development may be hampered by inherent complexity of multi- 
dimensional fitness landscapes’ and subtle local confounding factors**. 
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